Search |
MonitorCode and View updates for sites/index.phpThis work is in preparation for asking sites to update their PCU information for Monitor. I've created a better view for sites/index.php. It lists all nodes and allows techs or PIs to associate nodes with a PCU. Additionally, the organization of the code is split more clearly between a template (standard html with minimal php variables, and loops) and control function. I believe this will make it easier to integrate with additional ajax code later. I will request OneLab for comments on the code reorganization, using sites/index.php as an example.
Monitor SuspendedMonitor has been suspended this week, due to operational setbacks. At the beginning of the week the database server was experiencing extreme load, and resulting in timeouts or failed boots by remote sites, complicating support for Monitor tickets. Rather than generate additional traffic, I postponed running monitor again. The mis-match of the PlanetLab-branch.tar.gz bundle that's downloaded during a 'rins' was fixed for I2 nodes (of which there are many), so that after a rins 'codemux' is correctly installed. It's not clear how this happened other than just a stale version of PlanetLab-branch.tar.gz that didn't get updated to the 4.1 version. Monitor Stats OverviewLet Monitor loose on over 200 nodes this morning. Nodes in some states are not acted on: debug, down < 7 days, with a PCU. There were 60 sites with unresponded tickets before, and now there are 97 tickets in RT actively managed by Monitor, representing outstanding issues with a Site not just a node. sites_today: 52 sites_total : 97 Site Assist and Monitor
I've added Site Assistant docs that describe some of the features/policy currently implemented in Monitor as well as what will come from Monitor in the days to come.
July 16-20, 2007
July 16-20,2007
July 2-6th, 2007
June 29, 2007
Faiyaz pointed out that the NotifyPersons() api call may be better suitted to sending Email to Techs/PIs etc, than the EMail aliases anyway. I will look into that after the first round. June 15, 2007Monitor I've added a 'monitor' queue to RT support list in order to catch bounced messages and be a central list for the correspondence with site managers. This is currently only visible to me in the RT GUI. Finishing the monitor code for debug and down sites. The processing moved from node-based to site-based, grouping all complaints into a single email in order to be more friendly. I've added blacklists to prevent certain sites from every being considered regardless of whether they are RT tickets or not. As well the code differentiates clearly from the 'diagnosis' of a node's problem and the 'action' taken on it. |
PlanetLab loginAnnouncements
|