ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/Vuko/Notes/ElectronID/Variables.tex
Revision: 1.3
Committed: Fri Jun 20 00:46:07 2008 UTC (16 years, 10 months ago) by beaucero
Content type: application/x-tex
Branch: MAIN
Changes since 1.2: +15 -15 lines
Log Message:
Last Change for tonight

File Contents

# User Rev Content
1 beaucero 1.1 Electron is identified by combining information from the CMS
2     tracker and ECAL. Our initial (preselection) requirement is that
3     the electron candidate in addition to the kinematics criteria
4     described in the Section above, is also identified as a GSF electron,
5     {\it i.e.}, it has a GSF track matched with the ECAL Super Cluster (SC).
6     The efficiency of the selection criteria is measured with respect
7     to these initial requirements.
8    
9     The focus of this note is to optimize official loose and tight identification
10     criteria to identify electrons from $Z$ and $W$ decays, respectively.
11     This can be achieved by optimizing the thresholds, optimizing the
12     discriminants that have more background rejection power, and selecting
13     the variables that are not highly correlated with each other.
14     The latter allows keeping the number of variables in the criteria to a
15     minimum, which results in a simpler set of requirements with smaller
16     systematic uncertainties, and thus more robust in the startup conditions.
17    
18     Variables that allow discriminating electrons from em-jets can be roughly
19     divided into two classes: matching/shower-shape and isolation
20     discriminants. In the following we treat these two classes separately to
21     follow the existing egamma POG identification scheme. The variables are
22     described in the next two subsections.
23    
24     \subsection{Identification variables}
25     \label{ss:matching}
26     \subsubsection{Track-ECAL matching}
27     An electron GSF track should be well-matched to the ECAL Super Cluster (SC), while
28     a $\pi^0$ energy deposit in ECAL is not necessarily matched well with a GSF track, which
29     is usually belong to a charged pion. Thus, a spatial match between a track and a SC can be a good
30     discriminant. This match is described in azimuthal and pseudorapidity planes and denoted as
31 kkaadze 1.2 $\Delta\phi$ and $\Delta\eta$, respectively.
32 beaucero 1.1
33 beaucero 1.3 \subsubsection{E/p-based variables}
34     Electron should deposit all of its energy in the ECAL detector, thus the track momentum
35     at the outer edge of tracker $p_{out}$ should be of the same order as an energy of the seed EM
36     cluster $E_{seed}$. Jet is reconstructed combining information from CMS tracker and hadronic calomitere (HCAL). It interact with ECAL much less intensively and most of its energy, carried after tracker, is deposited in HCAL. Hence, $p_{out}$ and $E_{seed}$ differ significantly for the jets and can be used to discriminate them from electrons.
37     We also considered an official version of this discriminant $E_{SC}/p_{in}$, where $E_{SC}$ is a SC energy, and $p_{in}$ is the initial momentum of a charged particle.
38    
39 beaucero 1.1 \subsubsection{ECAL energy width}
40     An electron brems extensively in CMS tracker, which results in a rather wide shower in azimuthal
41     plane due to a strong CMS magnetic field. However, the width of a shower in pseudorapidity
42 kkaadze 1.2 plane remains very narrow and can discriminate against jets, which tend to create rather large
43     clusters in both $\eta$ and $\phi$ directions. We consider two parameterization of the shower width:
44     the one, used in official electron identification developed by egamma POG, $\sigma_{\eta\eta} = \sqrt{CovEtaEta}$ which describes the width of the highest-energy basic cluster of the SC, further referred
45     to as a seed cluster, and an energy-weighted width of SC in $\eta$ direction, defined as
46 beaucero 1.1 \begin{equation}
47     \label{eq:etawidth}
48     \sigma_\eta = \frac{1}{E_{SC}} \sqrt{\sum_{ECAL~SC~RecHits} E_i(\eta_i - \eta_{SC})^2}.
49     \end{equation}
50    
51     \subsubsection{H/E variables}
52     One can form a powerful discriminant by using the HCAL and ECAL energies associated
53     with an electron candidate, as electrons tend to deposit very little or no energy in HCAL,
54 kkaadze 1.2 while jets produce a wide energy deposition in the HCAL. An variable used in official
55     ``Robust" as well as ``Loose" and ``Tight" criteria is the ratio of HCAL and ECAL energies: $H/E$.
56     It peaks around 0 for electrons and can be quite large for em-jets. We form a variable that also
57 beaucero 1.3 takes into account the width of the ECAL energy deposition by making a ratio of the energy deposited
58 beaucero 1.1 in HCAL and ECAL in the cone of $\Delta R < 0.3$ which is not included in the SC, normalized
59     to the SC energy as follows
60    
61     \begin{equation}
62     \label{eq:emhad}
63 kkaadze 1.2 Iso_{EmHad} =\frac{1}{E_{SC}}\left(\sum_{\Delta R=0.3}E^{ecal}_{RecHit} +
64 beaucero 1.1 \sum_{\Delta R=0.3}E^{hcal}_{RecHit} - E_{SC}\right).
65     \end{equation}
66    
67     \subsubsection{Track isolation requirements}
68     As electrons from $W$ and $Z$ boson decays are isolated, requiring electron isolated from tracking
69     activity can significantly suppress em-jets that usually have a large number of soft tracks around the
70     leading $\pi^0$. It also can suppress real electrons from semi-leptonic decays of $b$ quarks which
71     tend to be non-isolated as well. We consider several versions of the track isolation requirements:
72    
73     \begin{equation}
74     \label{eq:trkIsoN}
75     IsoN_{trk} = \frac{1}{p_T(e)}\left(\sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk)\right),
76     \end{equation}
77    
78     and non-normalized version of the above:
79    
80     \begin{equation}
81     \label{eq:trkIso}
82     Iso_{trk} = \sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk).
83     \end{equation}
84    
85 kkaadze 1.2 %We also test the track isolation discriminant defined within electroweak group:
86     %\begin{equation}
87     %\label{eq:trkIsoEWK}
88     %Iso_{trk}(EWK) = \sum_{\Delta R = 0.6} p_T(trk) - \sum_{\Delta R = 0.02} p_T(trk).
89     %\end{equation}
90 beaucero 1.1
91     %\subsubsection{ECAL and HCAL isolation requirements}
92     %We also consider a few ECAL
93     %
94     %In order to discriminate electron from em-jet isolation variable defined using both ECAL and HCAL is used.
95     %Jet deposit high fraction of its energy in HCAL and much less in ECAL. While electron deposits all its energy in ECAL
96     %with very small hadronic fraction, unless it is very energetic when longitudinal energy leakage appears.
97     %Isolation variable, defined as Eq.~\ref{eq:3}, is very similar by the content to the variable $H/E$,
98     %ratio of energy deposited HCAL behind the SC over the SC energy, which is used in official egamma POG
99     %electron identification criteria.
100    
101    
102     \subsection{Optimization method and strategy}
103 kkaadze 1.2 We start building the ``Simple Loose" electron criteria based on the discriminants utilized in the official
104 beaucero 1.1 ``Robust'' and adding more variables from the ``Loose" criteria (or their more powerful variants described
105     above), and tuning the threshold to keep the efficiency of each criterion to be
106 kkaadze 1.2 above 99\%. ``Simple Tight'' selection is developed for selection electrons coming from $W$ boson. In order
107     to keep the whole selection robust and avoid many different thresholds and reduce source of systematic
108     uncertainties we define ``Simple Tight'' based on ``Simple Loose'' with one addition discriminator.
109     %A similar optimization is done for the ``Simple Tight" requirements, although, the efficiency
110     %was not required to exceed 99\% for the a given criterion. Instead,
111     To pick up the threshold for the tight criterion we plot signal efficiency $v.s.$ background efficiency
112     and find a region in the plot that is closest to the ``perfect" performance corner that has 100\% signal
113     and 0\% background efficiencies.
114 beaucero 1.1
115     We study the performance of the requirements by applying them in sequential order, starting
116     with the most simple and robust, and continuing to more complex
117     ones. We also study the correlation between variables by changing the order they are
118     applied to see if some of the variables are completely correlated with the others and can
119     be omitted.
120 kkaadze 1.2 Electrons reconstructed in barrel and endcap of electronmagnetic calorimeter are considered seperately.
121     WZ signal samples are used as a source of real electrons and multijet samples from ``Gumbo soup'' are used as
122     a source of em-jets.
123    
124     \subsection{Tuning ``Simple Loose" criteria}
125     Tunning of ``WZ Loose'' critaria is described below.
126     As mentioned above, we start from the variables used within ``Robust'' selection and studying their performance
127     on the our samples and optimizing thresholds. Spatial track matching variables, $\Delta\eta$ and $\Delta\phi$,
128     are used at the first dictriminants. The disctribution of these variables are shown on figures ~\ref{fig:DeltaEta}
129     and ~\ref{fig:DeltaPhi} respectively.
130    
131 beaucero 1.3 \begin{figure}[!tb]
132 kkaadze 1.2 \begin{center}
133     \includegraphics[width=16cm]{Figs/deltaEta.eps}
134     \vspace{-10mm}
135     \caption{Upper plots represent the distribution of $\Delta\eta$ for electrons and mis-identified jets in Barrel(left) and Endcap(right). Lower plots represent background efficiency $v.s.$ signal efficiency for given criterion in Barrel(left) and Endcap(right).
136     \label{fig:DeltaEta}}
137     \end{center}
138     \end{figure}
139    
140    
141 beaucero 1.3 \begin{figure}[!tb]
142 kkaadze 1.2 \begin{center}
143     \includegraphics[width=16cm]{Figs/deltaPhi.eps}
144     \vspace{-10mm}
145     \caption{Upper plots represent the distribution of $\Delta\phi$ for electrons and mis-identified jets in Barrel(left) and Endcap(right). Lower plots represent background efficiency $v.s.$ signal efficiency for given criteron in Barrel(left) and Endcap(right).
146     \label{fig:DeltaPhi}}
147     \end{center}
148     \end{figure}
149    
150    
151     Third disctriminating variable included in ``Simple Loose'' selection is based on shower shape in ECAL. We stady
152     two ways of parametrization described in section 4.1.2 and checked their power on rejecting em-jets lookeing
153     at background efficinecy $v.s.$ signal efficinecy.This comparison is presented figure ~\ref{fig:SigmaEE_vs_EtaWidth}.
154    
155 beaucero 1.3 \begin{figure}[!tb]
156 kkaadze 1.2 \begin{center}
157     \includegraphics[width=16cm]{Figs/Eff_EtaWidth_SigmaEtaEta.eps}
158     \vspace{-10mm}
159     \caption{Plot on the left represents background efficiency $v.s.$ signal efficiency in Barrel for $\eta$ width of SC (red) and $\sigma_{\eta\eta}$ (blue). Plot on the right represents the same for Endcap.
160     \label{fig:SigmaEE_vs_EtaWidth}}
161     \end{center}
162     \end{figure}
163    
164     As it can be seen $\sigma_{\eta\eta}$ variable is more powerfull in Barrel allowing to reject \~ 70\% of em-jet with only couple of \% loss of signal, while in endcap the performance of these two variables are more comparable. This result leads us to keep $\sigma_{\eta\eta}$ in our selection. Distribution of $\sigma_{\eta\eta}$ is presented on figure ~\ref{fig:SigmaEtaEta}.
165    
166 beaucero 1.3 \begin{figure}[!tb]
167 kkaadze 1.2 \begin{center}
168     \includegraphics[width=16cm]{Figs/Eff_EtaWidth_SigmaEtaEta.eps}
169     \vspace{-10mm}
170     \caption{Upper plots represent the distribution of $\sigma_{\eta\eta}$ for electrons and mis-identified jets in Barrel(left) and Endcap(right). Lower plots represent background efficiency $v.s.$ signal efficiency for this discriminator in Barrel(left) and Endcap(right).
171     \label{fig:SigmaEtaEta}}
172     \end{center}
173     \end{figure}
174    
175    
176     These three discriminants are the same as used in official ``Robust'', ``Loose'' or ``Tight'' selections.
177     For utilizing matching of energy/momentum from ECAL and tracker two variables are studied: $E_{seed}/p_{out}$ and $E/p$,
178     where the latter one is used in official ``Loose'' and ``Tight'' selection to categorize electons on $E/p$ $v.s.$ $fbrem$
179     plane. Here $fbrem$ is defined as $(p_{in}-p_{out})/p_{in}$, where $p_{in}$ and $p_{out}$ are momenta measured at the
180     inner and outer edge of tracker respectively. Comparison the signal $v.s.$ background efficiency for these two options
181     is shown in figure ~\ref{fig:EseedOPout_vs_EoP}.
182    
183    
184 beaucero 1.3 \begin{figure}[!tb]
185 kkaadze 1.2 \begin{center}
186     \includegraphics[width=16cm]{Figs/Eff_EoP_EseedoPout.eps}
187     \vspace{-10mm}
188     \caption{Plot on the left represents background efficiency $v.s.$ signal efficiency in Barrel for $E/p$ (red) and $E_{seed}/p_{out}$ (blue). Plot on the right represents the same for Endcap.
189     \label{fig:EseedOPout_vs_EoP}}
190     \end{center}
191     \end{figure}
192    
193    
194     $E_{seed}/p_{out}$ is included as another criterion in ``Simple Loose'' selection. Its distribution both in barrel
195     and endcap is shown on figure ~\ref{fig:EseedOPout}
196    
197 beaucero 1.3 \begin{figure}[!tb]
198 kkaadze 1.2 \begin{center}
199     \includegraphics[width=16cm]{Figs/EseedPout.eps}
200     \vspace{-10mm}
201     \caption{Upper plots represent the distribution of $E_{seed}/p_{out}$ for electrons and mis-identified jets in Barrel(left) and Endcap(right). Lower plots represent background efficiency $v.s.$ signal efficiency for this discriminator in Barrel(left) and Endcap(right).
202     \label{fig:EseedOPout}}
203     \end{center}
204     \end{figure}
205    
206    
207     In order to select isolated electron candidates we use track isolation. As mentioned in section 4.1.5 we
208     study both normalized and non-normalized definition of it. We compare which one of the two definitions allow
209     us better identify real electrons from em-jets, figure ~\ref{fig:TrkIso_no_vs_norm}
210    
211    
212 beaucero 1.3 \begin{figure}[!tb]
213 kkaadze 1.2 \begin{center}
214     \includegraphics[width=16cm]{Figs/Eff_trkISoNonNorm_trkIsoNorm.eps}
215     \vspace{-10mm}
216     \caption{Plot on the left represents background efficiency $v.s.$ signal efficiency in Barrel for non-normalized (red) and normalized (blue) definition of track isolation. Plot on the right represents the same for Endcap.
217     \label{fig:TrkIso_no_vs_norm}}
218     \end{center}
219     \end{figure}
220    
221    
222     We keep isolation defined as ~\ref{eq:trkIsoN} in ``Simple Loose'' selection.
223    
224    
225     \subsection{Tuning ``Simple Tight" criteria}
226     As mentioned in section 4.2 we stydy another isolation variable, using the information from ECAL and HCAL, in order
227     to reject higher fraction of em-jets which can easily mimic electron from $W$ boson decay. The comparison is done
228     between $H/E$ and $Iso_{EmHad}$ ($Sec 4.1.4$) variables and their discrimination power. First obsetvarion is that there
229     is clear correlation between them, once $Iso_{EmHad}$ discriminator is applied to select elctrons the efficiency of
230     $H/E$ variable to reject em-jets is hardly a few \%. Also the study has shown that isoaltion variable is more effecient
231     in selecting real electrons than $H/E$. Thus we keep $Iso_{EmHad}$ as the additional criterion for ``Simple Tight''
232     selection. Its istribution is presented on figure ~\ref{fig:EmHadIso}
233    
234 beaucero 1.3 \begin{figure}[!tb]
235 kkaadze 1.2 \begin{center}
236     \includegraphics[width=16cm]{Figs/EmHadIso.eps}
237     \vspace{-10mm}
238     \caption{Upper plots represent the distribution of $Iso_{EmHad}$ for electrons and mis-identified jets in Barrel(left) and Endcap(right). Lower plots represent background efficiency $v.s.$ signal efficiency for this discriminator in Barrel(left) and Endcap(right).
239     \label{fig:EmHadIso}}
240     \end{center}
241     \end{figure}
242    
243    
244    
245     %TABLE
246    
247     The table below table~\ref{tab:OurSelection} contains the threshold values and their corresponding efficiencies
248     on signal and background for barrel and endcap. We select electrons if its discriminators are less than
249     corresponding thresholds for all the cases except the criterion $E_{seed}/p_{out}$, when we require it to be more than
250     threshold.
251    
252    
253    
254     %\begin{table}[htb]
255     % \caption{Thresholds and efficinecy of the criteria.}
256     % \label{tab:DeltaEtaDeltaPhi}
257     % \begin{center}
258     % \begin{tabular}{|c|c|c|c|} \hline
259     % & Barrel & Endcap \\ \hline
260     % & $Thr$ & $Eff_{sig}$ & $Bkg_{bkg}$ & $Thr$ & $Eff_{sig}$ & $Bkg_{bkg}$ \\ \hline
261     %$\Delta\eta$ & 0.009 & ----- & ----- & 0.007 & ---- & ---- & \\ \hline
262     %$\Delta\phi$ & 0.005 & ----- & ----- & 0.005 & ---- & ---- & \\ \hline
263     % \end{tabular}
264     % \end{center}
265     % \end{table}
266    
267    
268    
269 beaucero 1.1
270    
271