This article has multiple issues. Please help improve it or discuss these issues on the talk page. (Learn how and when to remove these template messages)
This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed. Find sources: "Fault management" – news · newspapers · books · scholar · JSTOR(October 2017) (Learn how and when to remove this message)
This article may be too technical for most readers to understand. Please help improve it to make it understandable to non-experts, without removing the technical details.(October 2017) (Learn how and when to remove this message)
(Learn how and when to remove this message)
In network management, fault management is the set of functions that detect, isolate, and correct malfunctions in a telecommunications network, compensate for environmental changes, and include maintaining and examining error logs, accepting and acting on error detection notifications, tracing and identifying faults, carrying out sequences of diagnostics tests, correcting faults, reporting error conditions, and localizing and tracing faults by examining and manipulating database information.[1]
When a fault or event occurs, a network component will often send a notification to the network operator using a protocol such as SNMP. An alarm is a persistent indication of a fault that clears only when the triggering condition has been resolved. A current list of problems occurring on the network component is often kept in the form of an active alarm list such as is defined in RFC 3877, the Alarm MIB. A list of cleared faults is also maintained by most network management systems.[2]
Fault management systems may use complex filtering systems to assign alarms to severity levels. These can range in severity from debug to emergency, as in the syslog protocol.[3] Alternatively, they could use the ITU X.733 Alarm Reporting Function's perceived severity field. This takes on values of cleared, indeterminate, critical, major, minor or warning. Note that the latest version of the syslog protocol draft under development within the IETF includes a mapping between these two different sets of severities. It is considered good practice to send a notification not only when a problem has occurred, but also when it has been resolved. The latter notification would have a severity of clear.
A fault management console allows a network administrator or system operator to monitor events from multiple systems and perform actions based on this information. Ideally, a fault management system should be able to correctly identify events and automatically take action, either launching a program or script to take corrective action, or activating notification software that allows a human to take proper intervention (i.e. send e-mail or SMS text to a mobile phone). Some notification systems also have escalation rules that will notify a chain of individuals based on availability and severity of alarm.
^"What is fault management? - Definition from WhatIs.com". Retrieved 2015-10-06.
^"What Is Fault Management? A Definition & Introductory Guide". XpoLog Log Analysis, Management & Viewer. 2020-04-07. Retrieved 2020-11-15.
In network management, faultmanagement is the set of functions that detect, isolate, and correct malfunctions in a telecommunications network, compensate...
is the ISO Telecommunications Management Network model and framework for network management. FCAPS is an acronym for fault, configuration, accounting, performance...
fault). Larger power systems require active management. In industrial plants or mining sites a single team might be responsible for faultmanagement,...
addressing issues related to the large-scale deployment, accounting, and faultmanagement. Features and enhancements included: Identification of SNMP entities...
In computing, a page fault (sometimes called PF or hard fault) is an exception that the memory management unit (MMU) raises when a process accesses a memory...
be loaded on the device, including applications and system software FaultManagement – Report errors from the device, query about status of device All of...
page fault generally leads to a segmentation fault, and segmentation faults and page faults are both faults raised by the virtual memory management system...
Fault tree analysis (FTA) is a type of failure analysis in which an undesired state of a system is examined. This analysis method is mainly used in safety...
features required by a FMSR are: Switching management & Suggested switching plan The DMS application receives faults information from the SCADA system and...
Network management is the process of administering and managing computer networks. Services provided by this discipline include fault analysis, performance...
operating system instance for the purpose of pairing crash dump data with FaultManagement Event in the case of kernel panic. The "partition label" and the "partition...
The Humboldt Fault or Humboldt Fault Zone, is a normal fault or series of faults, that extends from Nebraska southwestwardly through most of Kansas. Kansas...
computer engineering in 1994 with the thesis Application-Transparent FaultManagement under the supervision of Zary Segall. From September 1994 through February...
networks). They support management functions such as network inventory, service provisioning, network configuration and faultmanagement. Together with business...
suffix) Confocal microscopy, an optical imaging technique Connectivity FaultManagement, protocols that help administrators debug Ethernet networks in the...
Madrid Seismic Zone (NMSZ) (/ˈmædrɪd/), sometimes called the New Madrid Fault Line, is a major seismic zone and a prolific source of intraplate earthquakes...
Fault tolerance is the ability of a system to maintain proper operation in the event of failures or faults in one or more of its components. Any decrease...
is represented by a state Faultmanagement, where the Stateflow chart is used to control how the system responds to faults and failures within a system...
is a method of problem solving used for identifying the root causes of faults or problems. It is widely used in IT operations, manufacturing, telecommunications...
The Hayward Fault Zone is a right-lateral strike-slip geologic fault zone capable of generating destructive earthquakes. The fault was first named in the...