Added computation om probability of maximum fault isolation performance

4e7e25bd · Erik Frisk · ef28f1a2 · 4e7e25bd
Commit 4e7e25bd authored 6 years ago by Erik Frisk
--- a/code/main.ipynb
+++ b/code/main.ipynb
@@ -101,7 +101,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Plot the 7 first residuals in all fault modes. The residuals plotted in red are supposed to alarm for the fault according to the fault sensitivity matrix."
+    "Plot the 7 first residuals in all fault modes. The residuals plotted in red are supposed to alarm for the fault according to the fault sensitivity matrix.\n",
+    "\n",
+    "(Fig. 4 in the paper)"
   ]
  },
  {
@@ -153,7 +155,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Plot the ideal fault isolability matrix corresponding to the fault sensitivity matrix."
+    "Plot the ideal fault isolability matrix corresponding to the fault sensitivity matrix. \n",
+    "\n",
+    "(Fig. 6 in the paper)"
   ]
  },
  {
@@ -175,7 +179,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Compute consistency based diagnoses and the corresponding confusion matrix based on all 42 thresholded residuals. The confusion matrix should be compared with the ideal fault isolation matrix above."
+    "Compute consistency based diagnoses and the corresponding confusion matrix based on all 42 thresholded residuals. The confusion matrix should be compared with the ideal fault isolation matrix above.\n",
+    "\n",
+    "(Fig. 7 in the paper)"
   ]
  },
  {
@@ -222,7 +228,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Plot the confusion matrix for the random forest classifier for training data"
+    "Plot the confusion matrix for the random forest classifier for training data\n",
+    "\n",
+    "(Fig. 8 in the paper)"
   ]
  },
  {
@@ -231,7 +239,8 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "C = np.diag([1/sum(thdata['mode']==mi) for mi in range(nf)])@confusion_matrix(thdata['mode'], rf.predict(thdata['res']))*100\n",
+    "s = np.diag([1/sum(thdata['mode']==mi) for mi in range(nf)])\n",
+    "C = s@confusion_matrix(thdata['mode'], rf.predict(thdata['res']))*100\n",
    "\n",
    "plt.figure(31, clear=True, figsize=(6, 6))\n",
    "du.PlotConfusionMatrix(C/100)\n",
@@ -245,7 +254,9 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Plot the variable importance, sorted, to get a ranking of predictor/residual usefullness in the classifier. Note that this classifier is not meant to be used in the diagnosis system."
+    "Plot the variable importance, sorted, to get a ranking of predictor/residual usefullness in the classifier. Note that this classifier is not meant to be used in the diagnosis system.\n",
+    "\n",
+    "(Fig. 10 in the paper)"
   ]
  },
  {
@@ -268,13 +279,16 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Compute performance measures on false-alarm (FA), missed detection (MD), and an aggregated fault isolation (FI) when selecting residuals according to the ranking computed above.\n",
+    "Compute performance measures on false-alarm (FA), missed detection (MD), aggregated fault isolation (FI) and the probability of maximum isolability performance (FI-max)\n",
+    "when selecting residuals according to the ranking computed above.\n",
    "\\begin{align*}\n",
    "  p_{\\text{FA}} &= 1 - P(NF\\in D|NF)\\\\\n",
    "  p_{\\text{MD}} &= \\frac{1}{n_{f}}\\sum_{f_{i}\\in \\tilde{\\mathcal{F}}} P(NF\\in D|f_{i})\\\\\n",
    "  p_{\\text{FI}} &= \\frac{1}{n_{f}^{2}}\\sum_{f_{i}\\in \\tilde{\\mathcal{F}}} P(NF\\notin\n",
-    "  D|f_{i})\\sum_{f_{j}\\in \\tilde{\\mathcal{F}}}|P(f_{j}\\in D|f_{i})-I_{ij}|\n",
+    "  D|f_{i})\\sum_{f_{j}\\in \\tilde{\\mathcal{F}}}|P(f_{j}\\in D|f_{i})-I_{ij}|\\\\\n",
-    "\\end{align*}"
+    "  p_{\\text{FI-max}} &= P(D=F_{f_i}|f_i)\n",
+    "\\end{align*}\n",
+    "where $F_{f_i}$ is the set of faults nont structurally isolable from fault $f_i$."
   ]
  },
  {
@@ -286,20 +300,32 @@
    "pfa = np.zeros(nr-1)\n",
    "pmd = np.zeros(nr-1)\n",
    "pfi = np.zeros(nr-1)\n",
+    "pmfi = np.zeros((nr-1, nf))\n",
+    "\n",
+    "# Make sure isolability matrix for NF corresponds to diagnosis statement \n",
+    "# computed by DiagnosesAndConfusionMatrix\n",
+    "imk = du.IsolabilityMatrix(data['fsm'])\n",
+    "imk[0] = np.zeros(nf)\n",
+    "imk[0, 0] = 1  \n",
    "\n",
    "for k in range(1, nr):\n",
-    "    imk = du.IsolabilityMatrix(data['fsm'][sortIdx[0:k]])\n",
+    "    dx, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:k])\n",
-    "    _, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:k])\n",
    "    pfa[k-1] = 1-C[0,0]\n",
    "    pmd[k-1] = np.mean(C[1:,0])\n",
-    "    pfi[k-1] = np.mean(np.diag(1-C[1:, 0])@np.abs(C[1:, 1:]-im[1:, 1:]))"
+    "    pfi[k-1] = np.mean(np.diag(1-C[1:, 0])@np.abs(C[1:, 1:]-im[1:, 1:]))\n",
+    "    \n",
+    "    for fi in range(nf):\n",
+    "        pmfi[k-1, fi] = np.mean(np.all(dx[thdata['mode'] == fi] == imk[fi, :],\n",
+    "                                       axis=1))"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Plot the three performance measures agains the number of selected residuals."
+    "Plot the three aggregated performance measures agains the number of selected residuals.\n",
+    "\n",
+    "(Fig. 11 in the paper)"
   ]
  },
  {
@@ -326,7 +352,31 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "Compute and display confusion matrices corresponding to selecting 10, 12, 26, and 27 residuals. The results should be compared to the confusion matrix above where all 42 residuals were used."
+    "Plot the probability of maximum fault isolation performance for each fault.\n",
+    "\n",
+    "(Fig. 12 in the paper)"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "plt.figure(figsize=(10, 10))\n",
+    "for k in range(nf):\n",
+    "    plt.plot(pmfi[:, k], label=thdata['modes'][k])\n",
+    "plt.legend(loc='upper right')\n",
+    "BoxOff()"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Compute and display confusion matrices corresponding to selecting 10, 12, 26, and 27 residuals. The results should be compared to the confusion matrix above where all 42 residuals were used.\n",
+    "\n",
+    "(Fig. 13 in the paper)"
   ]
  },
  {

 %% Cell type:markdown id: tags:
 # _Residual Selection for Consistency Based Diagnosis Using Machine Learning Models_
 by Erik Frisk &lt;erik.frisk@liu.se&gt; and Mattias Krysander &lt;mattias.krysander@liu.se&gt;
 %% Cell type:markdown id: tags:
 Code corresponds to the paper "_Residual Selection for Consistency Based Diagnosis Using Machine Learning Models_" published at IFAC Safeprocess 2018 in Warszaw, Poland.
 Note that the plots are not identical to the results in the paper where a Matlab implementation of the machine learning algorithms were used. However, the methodology is the same and the results are similar.
 %% Cell type:markdown id: tags:
 ## Basic python imports
 %% Cell type:code id: tags:
 ``` python
 import numpy as np
 import matplotlib.pyplot as plt
 from diagutil import BoxOff
 import diagutil as du
 from sklearn.ensemble import RandomForestClassifier
 from sklearn.metrics import confusion_matrix
 ```
 %% Cell type:markdown id: tags:
 ## Load the data
 %% Cell type:markdown id: tags:
 The data is loaded into a dictionary with 4 fields
 * modes - an array with names of no-fault and fault modes
 * res - An array with the 42 residuals
 * mode - a vector indicating which fault is active at each sample
 * fsm - A fault signature matrix based on model structure
 %% Cell type:code id: tags:
 ``` python
 data = du.loadmat('../data/data.mat')['data']
 nf = len(data['modes'])
 nr = data['res'].shape[1]
 ```
 %% Cell type:markdown id: tags:
 # Preprocess data
 %% Cell type:markdown id: tags:
 Preprocesses data in two steps
 1. Take absolute values of residuals (absdata)
 2. Threshold data (thdata)
 The data is normalized so that a threshold at 1 corresponds to probability of false alarm of approximately 1%.
 %% Cell type:code id: tags:
 ``` python
 absdata = data.copy()
 absdata['res'] = np.abs(absdata['res'])
 thdata = absdata.copy()
 thdata['res'] = thdata['res'] >= 1
 ```
 %% Cell type:markdown id: tags:
 Plot the 7 first residuals in all fault modes. The residuals plotted in red are supposed to alarm for the fault according to the fault sensitivity matrix.
+(Fig. 4 in the paper)
 %% Cell type:code id: tags:
 ``` python
 plt.figure(10, clear=True, figsize=(10, 10))
 for ri in range(7):
    for fm in range(nf):
        plt.subplot(7, 8, ri*nf + fm + 1)
        if absdata['fsm'][ri, fm]==0:
            plt.plot(absdata['res'][absdata['mode']==fm, ri], 'b', lw=0.3)
        else:
            plt.plot(absdata['res'][absdata['mode']==fm, ri], 'r', lw=0.3)
        plt.gca().tick_params(labelsize=6)
        plt.ylim(0, 3)
        BoxOff()
        if fm==0:
            plt.ylabel('res-%d' % (ri+1), fontsize=8)
        if ri==0:
            plt.title(absdata['modes'][fm], fontsize=8)
 plt.tight_layout(w_pad=-0.75, h_pad=0)
 ```
 %% Cell type:markdown id: tags:
 # Basic analysis - performance of all 42 residuals
 %% Cell type:code id: tags:
 ``` python
 ts = np.zeros((nr, nf))
 for ri in range(nr):
    for fm in range(nf):
        Nfm = np.sum(absdata['mode']==fm)
        Nalarm = np.sum(absdata['res'][absdata['mode']==fm, ri]>=1)
        ts[ri, fm] = Nalarm/Nfm
 ```
 %% Cell type:markdown id: tags:
 Plot the ideal fault isolability matrix corresponding to the fault sensitivity matrix.
+(Fig. 6 in the paper)
 %% Cell type:code id: tags:
 ``` python
 im = du.IsolabilityMatrix(data['fsm'])
 plt.figure(20, clear=True, figsize=(6, 6))
 plt.spy(im[1:, 1:], marker='o', color='b')
 plt.xticks(np.arange(nf), data['modes'][1:])
 plt.yticks(np.arange(nf), data['modes'][1:])
 plt.title('Isolability matrix')
 plt.gca().xaxis.tick_bottom()
 ```
 %% Cell type:markdown id: tags:
 Compute consistency based diagnoses and the corresponding confusion matrix based on all 42 thresholded residuals. The confusion matrix should be compared with the ideal fault isolation matrix above.
+(Fig. 7 in the paper)
 %% Cell type:code id: tags:
 ``` python
 _, C = du.DiagnosesAndConfusionMatrix(thdata)
 plt.figure(30, clear=True, figsize=(6, 6))
 du.PlotConfusionMatrix(C)
 plt.xticks(np.arange(nf), data['modes'])
 plt.yticks(np.arange(nf), data['modes'])
 plt.title('Fault Isolation Performance matrix')
 plt.tight_layout()
 ```
 %% Cell type:markdown id: tags:
 ## Test selection using Random Forest Classifiers
 %% Cell type:markdown id: tags:
 First, build a random forest classifier based on the thresholded data. Here, 300 trees are trained in the tree ensemble.
 %% Cell type:code id: tags:
 ``` python
 rf = RandomForestClassifier(n_estimators=300)
 rf.fit(thdata['res'], thdata['mode'])
 sortIdx = np.argsort(rf.feature_importances_)[::-1]
 ```
 %% Cell type:markdown id: tags:
 Plot the confusion matrix for the random forest classifier for training data
+(Fig. 8 in the paper)
 %% Cell type:code id: tags:
 ``` python
-C = np.diag([1/sum(thdata['mode']==mi) for mi in range(nf)])@confusion_matrix(thdata['mode'], rf.predict(thdata['res']))*100
+s = np.diag([1/sum(thdata['mode']==mi) for mi in range(nf)])
+C = s@confusion_matrix(thdata['mode'], rf.predict(thdata['res']))*100
 plt.figure(31, clear=True, figsize=(6, 6))
 du.PlotConfusionMatrix(C/100)
 plt.xticks(np.arange(nf), data['modes'])
 plt.yticks(np.arange(nf), data['modes'])
 plt.title('Fault Isolation Performance matrix')
 plt.tight_layout()
 ```
 %% Cell type:markdown id: tags:
 Plot the variable importance, sorted, to get a ranking of predictor/residual usefullness in the classifier. Note that this classifier is not meant to be used in the diagnosis system.
+(Fig. 10 in the paper)
 %% Cell type:code id: tags:
 ``` python
 plt.figure(40, clear=True, figsize=(9, 6))
 plt.plot(rf.feature_importances_[sortIdx])
 plt.yticks(fontsize=8)
 plt.xticks(range(nr), sortIdx+1, fontsize=8, rotation=90)
 plt.xlabel('Predictors')
 plt.ylabel('Importance')
 plt.title('Predictor importance')
 BoxOff()
 ```
 %% Cell type:markdown id: tags:
-Compute performance measures on false-alarm (FA), missed detection (MD), and an aggregated fault isolation (FI) when selecting residuals according to the ranking computed above.
+Compute performance measures on false-alarm (FA), missed detection (MD), aggregated fault isolation (FI) and the probability of maximum isolability performance (FI-max)
+when selecting residuals according to the ranking computed above.
 \begin{align*}
  p_{\text{FA}} &= 1 - P(NF\in D|NF)\\
  p_{\text{MD}} &= \frac{1}{n_{f}}\sum_{f_{i}\in \tilde{\mathcal{F}}} P(NF\in D|f_{i})\\
  p_{\text{FI}} &= \frac{1}{n_{f}^{2}}\sum_{f_{i}\in \tilde{\mathcal{F}}} P(NF\notin
-  D|f_{i})\sum_{f_{j}\in \tilde{\mathcal{F}}}|P(f_{j}\in D|f_{i})-I_{ij}|
+  D|f_{i})\sum_{f_{j}\in \tilde{\mathcal{F}}}|P(f_{j}\in D|f_{i})-I_{ij}|\\
+  p_{\text{FI-max}} &= P(D=F_{f_i}|f_i)
 \end{align*}
+where $F_{f_i}$ is the set of faults nont structurally isolable from fault $f_i$.
 %% Cell type:code id: tags:
 ``` python
 pfa = np.zeros(nr-1)
 pmd = np.zeros(nr-1)
 pfi = np.zeros(nr-1)
+pmfi = np.zeros((nr-1, nf))
+# Make sure isolability matrix for NF corresponds to diagnosis statement
+# computed by DiagnosesAndConfusionMatrix
+imk = du.IsolabilityMatrix(data['fsm'])
+imk[0] = np.zeros(nf)
+imk[0, 0] = 1
 for k in range(1, nr):
-    imk = du.IsolabilityMatrix(data['fsm'][sortIdx[0:k]])
+    dx, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:k])
-    _, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:k])
    pfa[k-1] = 1-C[0,0]
    pmd[k-1] = np.mean(C[1:,0])
    pfi[k-1] = np.mean(np.diag(1-C[1:, 0])@np.abs(C[1:, 1:]-im[1:, 1:]))
+    for fi in range(nf):
+        pmfi[k-1, fi] = np.mean(np.all(dx[thdata['mode'] == fi] == imk[fi, :],
+                                       axis=1))
 ```
 %% Cell type:markdown id: tags:
-Plot the three performance measures agains the number of selected residuals.
+Plot the three aggregated performance measures agains the number of selected residuals.
+(Fig. 11 in the paper)
 %% Cell type:code id: tags:
 ``` python
 num_res = [10, 12, 26, 27]
 plt.figure(50, clear=True, figsize=(9, 7))
 plt.plot(range(1, nr), pfa, 'r', label='False alarm probability')
 plt.plot(range(1, nr), pmd, 'b', label='Missed detection probability')
 plt.plot(range(1, nr), pfi, 'y', label='False isolation probability')
 for ni in num_res:
    plt.plot(ni, pfi[ni], 'kx')
 plt.legend()
 plt.xlabel('Number of selected residuals')
 plt.ylabel('Probability')
 BoxOff()
 ```
 %% Cell type:markdown id: tags:
+Plot the probability of maximum fault isolation performance for each fault.
+(Fig. 12 in the paper)
+%% Cell type:code id: tags:
+``` python
+plt.figure(figsize=(10, 10))
+for k in range(nf):
+    plt.plot(pmfi[:, k], label=thdata['modes'][k])
+plt.legend(loc='upper right')
+BoxOff()
+```
+%% Cell type:markdown id: tags:
 Compute and display confusion matrices corresponding to selecting 10, 12, 26, and 27 residuals. The results should be compared to the confusion matrix above where all 42 residuals were used.
+(Fig. 13 in the paper)
 %% Cell type:code id: tags:
 ``` python
 plt.figure(80, clear=True, figsize=(12, 12))
 for k, ni in enumerate(num_res):
    _, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:ni])
    plt.subplot(2, 2, k+1)
    du.PlotConfusionMatrix(C)
    plt.title('No of tests: %d' % ni)
    plt.xticks(np.arange(nf), data['modes'])
    plt.yticks(np.arange(nf), data['modes'])
 plt.tight_layout()
 ```
 %% Cell type:markdown id: tags:
 Compare performance of 12 with 42 residuals.
 %% Cell type:code id: tags:
 ``` python
 ntests = 12
 plt.figure(90, clear=True, figsize=(12, 12))
 plt.subplot(1, 2, 1)
 _, C = du.DiagnosesAndConfusionMatrix(thdata, residx=sortIdx[0:ntests])
 du.PlotConfusionMatrix(C)
 plt.title('No of tests: %d' % ntests)
 plt.xticks(np.arange(nf), data['modes'])
 plt.yticks(np.arange(nf), data['modes'])
 plt.subplot(1, 2, 2)
 _, C = du.DiagnosesAndConfusionMatrix(thdata)
 du.PlotConfusionMatrix(C)
 plt.title('No of tests: 42')
 plt.xticks(np.arange(nf), data['modes'])
 plt.yticks(np.arange(nf), data['modes'])
 plt.tight_layout()
 ```
 %% Cell type:code id: tags:
 ``` python
 ```