TancNote/note/tanc_nn_training.tex

The samples used to train the TaNC neural networks are typical of the signals
and backgrounds found in common physics analyses using taus.  The signal--type
training sample is composed of reconstucted tau--candidates that are matched to
generator level hadronic tau decays coming from simulated $Z \rightarrow
\tau^{+}\tau^{-}$ events.  The background training sample consists of reconstructed
tau--candidates in simulated QCD $2\rightarrow2$ hard scattering events.  For
both signal and background samples, 20\% of the events are reserved as a
statistically independent sample to evaluate the performance of the neural nets
after the training is completed.  The TaNC uses the ``MLP'' neural network
implementation provided by the TMVA software package, described in ~\cite{TMVA}.

The signal and background samples are split into five subsamples corresponding
to each reconstructed decay mode.  An additional selection is applied to each
subsample by requiring a ``leading pion'': either a charged hadron or gamma
candidate with transverse momentum greater than 5 GeV$/c$.  A large number of
QCD training events is required as the leading pion selection and the
requirement that the decay mode match one of the dominant modes given in table
~\ref{tab:decay_modes} are both effective discriminants.  For each subsample,
10000 signal and background tau--candidates are reserved to be used internally
by the TMVA software to test for over--training. The number of signal and
background entries used for each decay mode subsample is given in table
~\ref{tab:trainingEvents}.

\begin{table}
   \centering
   \begin{tabular}{lc|c|}
      \multirow{2}{*}{Decay Mode} & \multicolumn{2}{c}{Training Events} \\
                                  & Signal & Background \\
      \hline
      $\pi^{-}$                         & blah   & blah     \\
      $\pi^{-}\pi^0$                    & blah   & blah     \\
      $\pi^{-}\pi^0\pi^0$               & blah   & blah     \\
      $\pi^{-}\pi^{-}\pi^{+}$           & blah   & blah     \\
      $\pi^{-}\pi^{-}\pi^{+}\pi^0$      & blah   & blah     \\
      \hline
      Total number of events            & blah & blah       \\
      \hline
      Training preselection efficiency  & blah & blah       \\
   \end{tabular}
   \label{tab:trainingEvents}
   \caption{Number of events used for neural network training for each
   selected decay mode.}
\end{table}


The neural nets use asi nput variables the transverse momentum and $\eta$ of the
tau--candidates.  These variables are included as their correlations with other
observables can increase the separation power of the ensemble of observables.
For example, the opening angle in $\Delta R$ for signal tau--candidates is
inversely related to the transverse momentum, while for background events the
correlation is very small (see~\cite{DavisTau}). In the training signal and
background samples, there is significant discrimination power in the $P_T$
spectrum.   However, it is desirable to eliminate any systematic dependence of
the neural network output on $P_T$ and $\eta$, as a physics analysis that makes use
of the TaNC algorithm will in general have a different $P_T$ and $\eta$
spectrum. \fixme(this sentance sucks)  The dependence on $P_T$ and $\eta$ is
removed by applying a $P_T$ and $\eta$ dependent weight to the tau--candidates
when training the neural nets.  

The weights are defined such that weighted $P_T-\eta$ two--dimensional distributions
are identical for the signal and background samples after the ``leading pion''
and decay mode selections have been applied.  The weights are defined  
\begin{align}
   W(P_T, \eta) &=& {\rm less}(p_{sig}(P_T, \eta), p_{bkg}(P_T, \eta))\\
   w_{sig}(P_T, \eta) &=& W(P_T, \eta)/p_{sig}(P_T, \eta) \\
   w_{bkg}(P_T, \eta) &=& W(P_T, \eta)/p_{bkg}(P_T, \eta) 
\end{align}

\fixme(workingpoint)

Revision:	1.1
Committed:	Wed Mar 31 01:22:23 2010 UTC (15 years, 1 month ago) by friis
Content type:	application/x-tex
Branch:	MAIN
Log Message:	Slow but steady
#	User	Rev	Content
1	friis	1.1	The samples used to train the TaNC neural networks are typical of the signals
2			and backgrounds found in common physics analyses using taus. The signal--type
3			training sample is composed of reconstucted tau--candidates that are matched to
4			generator level hadronic tau decays coming from simulated $Z \rightarrow
5			\tau^{+}\tau^{-}$ events. The background training sample consists of reconstructed
6			tau--candidates in simulated QCD $2\rightarrow2$ hard scattering events. For
7			both signal and background samples, 20\% of the events are reserved as a
8			statistically independent sample to evaluate the performance of the neural nets
9			after the training is completed. The TaNC uses the ``MLP'' neural network
10			implementation provided by the TMVA software package, described in ~\cite{TMVA}.
11
12			The signal and background samples are split into five subsamples corresponding
13			to each reconstructed decay mode. An additional selection is applied to each
14			subsample by requiring a ``leading pion'': either a charged hadron or gamma
15			candidate with transverse momentum greater than 5 GeV$/c$. A large number of
16			QCD training events is required as the leading pion selection and the
17			requirement that the decay mode match one of the dominant modes given in table
18			~\ref{tab:decay_modes} are both effective discriminants. For each subsample,
19			10000 signal and background tau--candidates are reserved to be used internally
20			by the TMVA software to test for over--training. The number of signal and
21			background entries used for each decay mode subsample is given in table
22			~\ref{tab:trainingEvents}.
23
24			\begin{table}
25			\centering
26			\begin{tabular}{lc\|c\|}
27			\multirow{2}{*}{Decay Mode} & \multicolumn{2}{c}{Training Events} \\
28			& Signal & Background \\
29			\hline
30			$\pi^{-}$ & blah & blah \\
31			$\pi^{-}\pi^0$ & blah & blah \\
32			$\pi^{-}\pi^0\pi^0$ & blah & blah \\
33			$\pi^{-}\pi^{-}\pi^{+}$ & blah & blah \\
34			$\pi^{-}\pi^{-}\pi^{+}\pi^0$ & blah & blah \\
35			\hline
36			Total number of events & blah & blah \\
37			\hline
38			Training preselection efficiency & blah & blah \\
39			\end{tabular}
40			\label{tab:trainingEvents}
41			\caption{Number of events used for neural network training for each
42			selected decay mode.}
43			\end{table}
44
45
46			The neural nets use asi nput variables the transverse momentum and $\eta$ of the
47			tau--candidates. These variables are included as their correlations with other
48			observables can increase the separation power of the ensemble of observables.
49			For example, the opening angle in $\Delta R$ for signal tau--candidates is
50			inversely related to the transverse momentum, while for background events the
51			correlation is very small (see~\cite{DavisTau}). In the training signal and
52			background samples, there is significant discrimination power in the $P_T$
53			spectrum. However, it is desirable to eliminate any systematic dependence of
54			the neural network output on $P_T$ and $\eta$, as a physics analysis that makes use
55			of the TaNC algorithm will in general have a different $P_T$ and $\eta$
56			spectrum. \fixme(this sentance sucks) The dependence on $P_T$ and $\eta$ is
57			removed by applying a $P_T$ and $\eta$ dependent weight to the tau--candidates
58			when training the neural nets.
59
60			The weights are defined such that weighted $P_T-\eta$ two--dimensional distributions
61			are identical for the signal and background samples after the ``leading pion''
62			and decay mode selections have been applied. The weights are defined
63			\begin{align}
64			W(P_T, \eta) &=& {\rm less}(p_{sig}(P_T, \eta), p_{bkg}(P_T, \eta))\\
65			w_{sig}(P_T, \eta) &=& W(P_T, \eta)/p_{sig}(P_T, \eta) \\
66			w_{bkg}(P_T, \eta) &=& W(P_T, \eta)/p_{bkg}(P_T, \eta)
67			\end{align}
68
69			\fixme(workingpoint)
70