Feature #4222

Implement periodical detection and reporting of anomalies to system admins

Added by Jan Mach 12 months ago. Updated 10 months ago.

Status:ClosedStart date:07/27/2018
Priority:NormalDue date:
Assignee:Jan Mach% Done:

100%

Category:Development - GUI
Target version:2.1

Description

The anomaly report could contain things like these:
  • List of most common inspection errors in last day
  • Examples of unclassified events or events without assigned severity
  • List of detectors that are silent more than is usual
  • List of accounts that are pending activation
    etc.

Try to think about other useful information that might come in handy to system administrators.

Associated revisions

Revision 2586e789
Added by Jan Mach 10 months ago

Initial version of periodical anomaly detection scripts.

I have added prototype periodical anomaly detection scripts created by Pavel Kácha into Mentat package. So far these are just simple shell scripts intended to be executed periodically via cron that will simply use the PSQL utility to query database and send the result via email. Possible improvements might be to envelope these into Mentat script to make use of common configurations. (Redmine issue: #4222)

Revision 9e970a1c
Added by Jan Mach 10 months ago

Added documentation section about system administration.

Currently focus of this section is Mentat system health monitoring. (Redmine issue: #4222,#3361)

History

#1 Updated by Jan Mach 12 months ago

  • Description updated (diff)

#2 Updated by Jan Mach 12 months ago

  • Target version changed from Future to 2.1

#3 Updated by Jan Mach 10 months ago

  • Status changed from New to In Progress
  • % Done changed from 0 to 50

I have added periodical anomaly detection scripts created by Pavel Kácha to Mentat package. Possible additional improvement is to envelope them into Mentat script modules to make use of the framework features and common configurations.

#4 Updated by Jan Mach 10 months ago

  • Status changed from In Progress to Closed
  • % Done changed from 50 to 100

I have extended the documentation with page about Mentat system administration and monitoring in particular. Part of the section describes usage of the currently implemented database sanity monitoring scripts, that were added with this issue. There is still a lot of room for improvements, however I now consider this issue to be resolved, because the current implementation is enough for our use cases and there are more important issues to do. Further improvements can be done as separate issues.

Also available in: Atom PDF