ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/Vuko/Notes/ElectronID/Variables.tex
Revision: 1.4
Committed: Fri Jun 20 17:26:18 2008 UTC (16 years, 10 months ago) by ymaravin
Content type: application/x-tex
Branch: MAIN
Changes since 1.3: +211 -117 lines
Log Message:
YM: added Appendix for standard official ID, modified extensively the text, changed the names of eps files to make them compatible with macosx

File Contents

# Content
1 Electron is identified by combining information from the CMS
2 tracker and ECAL. Our initial (preselection) requirement is that
3 the electron candidate in addition to the kinematics criteria
4 described in the Section above, is also identified as a GSF electron,
5 {\it i.e.}, it has a GSF track matched with the ECAL Super Cluster (SC).
6 The efficiency of the selection criteria is measured with respect
7 to these initial requirements.
8
9 The focus of this note is to optimize official loose and tight identification
10 criteria defined in Appendix~\ref{a:OfficialCriteria}
11 to identify electrons from $Z$ and $W$ decays, respectively.
12 This can be achieved by optimizing the thresholds, optimizing the
13 discriminants that have more background rejection power, and selecting
14 the variables that are not highly correlated with each other.
15 The latter allows keeping the number of variables in the criteria to a
16 minimum, which results in a simpler set of requirements with smaller
17 systematic uncertainties, and thus more robust in the startup conditions.
18
19 Variables that allow discriminating electrons from em-jets can be roughly
20 divided into two classes: matching/shower-shape and isolation
21 discriminants. In the following we treat these two classes separately to
22 follow the existing egamma POG identification scheme. The variables are
23 described in the next two subsections.
24
25 \subsection{Identification variables}
26 \label{ss:matching}
27 \subsubsection{Track-ECAL matching}
28 An electron GSF track should be well-matched to the ECAL Super Cluster (SC), while
29 a $\pi^0$ energy deposit in ECAL is not necessarily matched well with a GSF track, which
30 is usually produced by a charged pion. Thus, a spatial match between a track and a SC can be a good
31 discriminant. This match is described in azimuthal and pseudorapidity planes and denoted as
32 $\Delta\phi$ and $\Delta\eta$, respectively.
33
34 \subsubsection{ECAL energy width}
35 \label{ss:etawidth}
36 An electron brems extensively in CMS tracker, which results in a rather wide shower in azimuthal
37 plane due to a strong CMS magnetic field. However, the width of a shower in pseudorapidity
38 plane remains very narrow and can discriminate against jets, which tend to create rather large
39 clusters in both $\eta$ and $\phi$ directions. We consider two parameterization of the shower width:
40 the one, used in official electron identification developed by egamma POG, $\sigma_{\eta\eta} = \sqrt{CovEtaEta}$ which describes the width of the highest-energy basic cluster of the SC, further referred
41 to as a seed cluster, and an energy-weighted width of SC in $\eta$ direction, defined as
42 \begin{equation}
43 \label{eq:etawidth}
44 \sigma_\eta = \frac{1}{E_{SC}} \sqrt{\sum_{ECAL~SC~RecHits} E_i(\eta_i - \eta_{SC})^2}.
45 \end{equation}
46
47 \subsubsection{E/p-based variables}
48 \label{ss:ep}
49 Electron should deposit all of its energy in the ECAL detector, thus the track momentum
50 at the outer edge of tracker $p_{out}$ should be of the same order as an energy of the seed EM
51 cluster $E_{seed}$. Jet is reconstructed combining information from CMS tracker and hadronic calomitere (HCAL). It interact with ECAL much less intensively and most of its energy, carried after tracker, is deposited in HCAL. Hence, $p_{out}$ and $E_{seed}$ differ significantly for the jets and can be used to discriminate them from electrons.
52 We also considered an official version of this discriminant $E_{SC}/p_{in}$, where $E_{SC}$ is a SC energy, and $p_{in}$ is the initial momentum of a charged particle.
53
54 \subsubsection{H/E variables}
55 One can form a powerful discriminant by using the HCAL and ECAL energies associated
56 with an electron candidate, as electrons tend to deposit very little or no energy in HCAL,
57 while jets produce a wide energy deposition in the HCAL. A variable used in official
58 ``Robust" as well as ``Loose" and ``Tight" criteria is the ratio of HCAL and ECAL energies: $H/E$.
59 It peaks around 0 for electrons and can be quite large for em-jets. We form a variable that also
60 takes into account the width of the ECAL energy deposition by making a ratio of the energy deposited
61 in HCAL and ECAL in the cone of $\Delta R < 0.3$ which is not included in the SC, normalized
62 to the SC energy as follows
63
64 \begin{equation}
65 \label{eq:emhad}
66 EmHad =\frac{1}{E_{SC}}\left(\sum_{\Delta R=0.3}E^{ecal}_{RecHit} +
67 \sum_{\Delta R=0.3}E^{hcal}_{RecHit} - E_{SC}\right).
68 \end{equation}
69
70 \subsubsection{Track isolation requirements}
71 \label{ss:trackIso}
72 As electrons from $W$ and $Z$ boson decays are isolated, requiring electron isolated from tracking
73 activity can significantly suppress em-jets that usually have a large number of soft tracks around the
74 leading $\pi^0$. It also can suppress real electrons from semi-leptonic decays of $b$ quarks which
75 tend to be non-isolated as well. We consider several versions of the track isolation requirements:
76
77 \begin{equation}
78 \label{eq:trkIsoN}
79 Iso^{trk}_{Norm.} = \frac{1}{p_T(e)}\left(\sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk)\right),
80 \end{equation}
81
82 and non-normalized version of the above:
83
84 \begin{equation}
85 \label{eq:trkIso}
86 Iso^{trk} = \sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk).
87 \end{equation}
88
89 We also test the track isolation discriminant with the larger outer cone, similar to that defined by electroweak group:~\footnote{Our analysis trees do not store information that would allow us to test
90 the isolation variable with an annulus of $0.02 < \Delta R < 0.6$. However, the performance
91 of a track isolation in an annulus of $0.05 < \Delta R < 0.5$ is similar.}
92 \begin{equation}
93 \label{eq:trkIsoEWK}
94 Iso^{trk}_{EWK} = \sum_{\Delta R = 0.6} p_T(trk) - \sum_{\Delta R = 0.02} p_T(trk).
95 \end{equation}
96
97
98
99
100 \subsection{Optimization method and strategy}
101 The focus of this work is to define two simple selection criteria: ``Simple Loose" and ``Simple Tight"
102 to select electrons from $Z \to ee$ and $W \to e\nu_e$ processes, respectively. We start building
103 the ``Simple Loose" electron criteria based on the discriminants utilized in the official ``Robust''
104 and adding more variables from the ``Loose" criteria (or their more powerful equivalents described
105 above), and tuning the thresholds to keep the efficiency of each criterion high. We change the order
106 of how the requirements are applied to assess the level of correlation between different criteria.
107 We try to retain only the variables that are as less correlated as possible to keep the number
108 of criteria to a minimum.
109
110 We design the ``Simple Tight'' requirements by applying additional criteria on top of the
111 unchanged ``Simple Loose''. We chose not to modify thresholds of ``Simple Loose'' criteria
112 to reduce possible systematic uncertainties and to make the process of extracting the
113 efficiency from data easier.
114
115 Both ``Simple Tight'' and ``Simple Loose'' criteria are designed separately for ECAL
116 barrel and endcap. We consider electron candidates with $p_T > 10$ GeV. The signal
117 electron sample was obtained from the $WZ$ Monte Carlo simulation, and the misidentified
118 jets are obtained from the "Gumbo" soup where we exclude $\gamma+X$ processes.
119
120 \subsection{Tuning the ``Simple Loose" criteria}
121 The first two criteria of ``Robust'' requirements are $\Delta \eta$ and $\Delta \phi$.
122 The distributions of these variables for signal and background together with the
123 em-jet separation power are given in Figs.~\ref{fig:DeltaEta}~and~\ref{fig:DeltaPhi}.
124 We apply both requirements with the thresholds of 0.009(0.007) and 0.05(0.05) for
125 $\Delta \eta$ and $\Delta \phi$ for barrel(endcap), respectively.
126
127 \begin{figure}[!tb]
128 \begin{center}
129 \includegraphics[width=16cm]{Figs/deltaEta.eps}
130 \vspace{-10mm}
131 \caption{Distributions of $\Delta \eta$ for electrons (blue solid line) and misidentified jets
132 (red dashed line) in barrel (top left) and endcap (top right).
133 The performance of these discriminants in terms of efficiencies to select
134 true electrons and misidentified jets are given below.
135 \label{fig:DeltaEta}}
136 \end{center}
137 \end{figure}
138
139
140 \begin{figure}[!tb]
141 \begin{center}
142 \includegraphics[width=16cm]{Figs/deltaPhi.eps}
143 \vspace{-10mm}
144 \caption{The same as Fig.~\ref{fig:DeltaEta}, but for $\Delta \phi$.
145 \label{fig:DeltaPhi}}
146 \end{center}
147 \end{figure}
148
149 The third discriminating variable included in the ``Robust'' criteria describes the width
150 of the EM shower. We study two different parameterizations of this variable
151 described in~\ref{ss:etawidth}. The performance of their discriminating power is given in
152 Fig.~\ref{fig:SigmaEE_vs_EtaWidth}.
153
154 \begin{figure}[!tb]
155 \begin{center}
156 \includegraphics[width=16cm]{Figs/Eff_EtaWidth_SigmaEtaEta.eps}
157 \vspace{-10mm}
158 \caption{The background efficiency $v.s.$ signal efficiency in barrel (left) and endcap (right)
159 for $\eta$ width of SC (red) and $\sigma_{\eta\eta}$ (blue).
160 \label{fig:SigmaEE_vs_EtaWidth}}
161 \end{center}
162 \end{figure}
163
164 As it can be seen, the $\sigma_{\eta\eta}$ variable is more powerful in barrel that can reject $\sim 70\%$
165 of em-jets while loosing only a couple of percents of signal. The performance in endcap is comparable
166 between the two. As $\sigma_{\eta\eta}$ is shown to be described very well by Monte
167 Carlo~\footnote{$\sigma_{\eta\eta}$ was studied using test beam data and was shown to be described
168 in simulation very well.} we chose to select $\sigma_{\eta\eta} < 0.012 (0.026)$ for barrel (endcap).
169 The plot with distributions of $\sigma_{\eta\eta}$ and the performance in barrel and endcap is given
170 in Fig.~\ref{fig:SigmaEtaEta}.
171 \begin{figure}[!tb]
172 \begin{center}
173 \includegraphics[width=16cm]{Figs/sigmaEtaEta.eps}
174 \vspace{-10mm}
175 \caption{
176 Distributions of $\sigma_{\eta\eta}$ for electrons (blue solid line) and misidentified jets
177 (red dashed line) in barrel (top left) and endcap (top right).
178 The performance of these discriminants in terms of efficiencies to select
179 true electrons and misidentified jets are given below.
180 \label{fig:SigmaEtaEta}}
181 \end{center}
182 \end{figure}
183
184 The discriminants that we chose so far are used in official ``Robust'' criteria.
185 The last requirement from the official criteria is $H/E$ which we chose to keep
186 unchanged from the predefined GSF electron requirement ($H/E < 0.2$).
187 It will be shown that $H/E$ variable is highly correlated with isolation requirements
188 that we will apply later, and thus it does not need to be tightened.
189
190 As all the discriminants from the ``Robust'' selection are considered, we start including
191 the variables utilized in the official ``Loose'' requirement. At first, we consider
192 $E_{seed}/p_{out}$ and $E/p$, described in Section~\ref{ss:ep}.
193 The latter is used in the official ``Loose'' and ``Tight'' selections to categorize
194 electrons in $E/p$ $v.s.$ $fbrem$ phase space, where $fbrem$ is defined as
195 \begin{equation}
196 \label{eq:fbrem}
197 fbrem = \frac{p_{in} - p_{out}}{p_{in}},
198 \end{equation}
199 where $p_{in}$ and $p_{out}$ are momenta measured at the inner and outer edge of
200 the tracker, respectively. The performance of the $E_{seed}/p_{out}$ and $E/p$
201 are given in Fig.~\ref{fig:EseedOPout_vs_EoP}.
202
203 \begin{figure}[!tb]
204 \begin{center}
205 \includegraphics[width=16cm]{Figs/Eff_EoP_EseedoPout.eps}
206 \vspace{-10mm}
207 \caption{Background $v.s.$ signal efficiency for barrel (left) and endcap (right) for $E/p$ (red)
208 and $E_{seed}/p_{out}$ (blue).
209 \label{fig:EseedOPout_vs_EoP}}
210 \end{center}
211 \end{figure}
212
213 $E_{seed}/p_{out}$ has a better performance and we include it to the ``Simple Loose''
214 requirement to be more than 0.9 for both barrel and endcap. The distributions of this
215 variable and discriminating performance is given in Fig.~\ref{fig:EseedOPout}.
216
217 \begin{figure}[!tb]
218 \begin{center}
219 \includegraphics[width=16cm]{Figs/EseedPout.eps}
220 \vspace{-10mm}
221 \caption{$E_{seed}/p_{out}$ for electrons(blue solid line) and mis-identified jets (red dashed line)
222 in barrel (left top) and endcap (right top). The performance of the criteria for different thresholds
223 is given in the bottom left plot for barrel, and bottom right plot for endcap.
224 \label{fig:EseedOPout}}
225 \end{center}
226 \end{figure}
227
228 Up to this point we utilized all of the variables of the ``Robust'' and ``Loose'' criteria with an exception
229 of a much looser $H/E$ requirement and substitution of $E/p$ with $E_{seed}/p_{out}$ variable.
230 As it is shown later in the note, the performance of this criteria with respect to the official ``Loose'' is
231 very comparable without complexity due to the categorization employed by the ``Loose'' criteria.
232 The improvement is mostly achieved by utilizing a more discriminating variables and tuned
233 thresholds.
234
235 It is possible to further improve the performance of the ``Simple Loose'' identification by applying
236 track isolation requirements. We study the performance of three various track isolation requirements
237 described in Section~\ref{ss:trackIso} where we varied the cone sizes from 0.5 to 0.05 and select
238 the most optimal values. The results for the variables with optimized cone sizes are given
239 in Fig.~\ref{fig:TrkIso_no_vs_norm}.
240
241 \begin{figure}[!tb]
242 \begin{center}
243 \includegraphics[width=16cm]{Figs/Eff_trkISoNonNorm_trkIsoNorm_trkIsoEWK.eps}
244 \vspace{-10mm}
245 \caption{Performance of three track isolation variables for barrel (left) and endcap (right).
246 \label{fig:TrkIso_no_vs_norm}}
247 \end{center}
248 \end{figure}
249
250 The normalized track isolation variable, $Iso^{trk}_{Norm}$, offers the best performance,
251 with an expense of making the efficiency to depend on the electron candidate $p_T$.
252 As it is described in the Section~\ref{Results}, the requirement results in a loss of efficiency
253 at low $p_T$; however, the loss is small in the characteristic range of $p_T$ of electrons
254 from $W$ and $Z$ boson decays.
255
256 By introducing the isolation requirement, we also reject a significant background from
257 non-isolated electrons from $Zb\bar{b}$ production, where $b$ quarks decay semileptonically.
258 The distribution of $Iso^{trk}_{Norm}$ for signal electrons and misidentified
259 light- and $b$-quark jets are given in Fig.~\ref{fig:TrackIsoChowd}
260
261 \begin{figure}[!tb]
262 \begin{center}
263 \includegraphics[width=16cm]{Figs/TrackIsoChowd.eps}
264 \vspace{-10mm}
265 \caption{The distribution of the normalized track isolation defined in Eq.~\ref{eq:trkIsoN} for signal
266 electrons (blue solid line), misidentified light-quark jets (red squares), and $b$-quark jets (black inverted
267 triangles) in barrel (left) and endcap (right).
268 \label{fig:TrackIsoChowd}
269 }
270 \end{center}
271 \end{figure}
272
273 We illustrate the performance of track isolation variable chosen for ``Simple Loose'' criteria
274 in Fig.~\ref{fig:trkIso}, and we set the requirement of $Iso^{trk}_{Norm}$ to be less than
275 0.1 (0.2) for barrel(endcap).
276 \begin{figure}[!tb]
277 \begin{center}
278 \includegraphics[width=16cm]{Figs/trkIso.eps}
279 \vspace{-10mm}
280 \caption{The $Iso^{trk}_{Norm}$ discriminant for electrons (blue soild line) and misidentified jets (red dashed line) for barrel (top left) and endcap (top right). The signal $v.s.$ background efficiencies
281 obtained by varying the threshold are displayed in bottom left (barrel) and bottom right (endcap) plots.
282 \label{fig:trkIso}
283 }
284 \end{center}
285 \end{figure}
286
287 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
288
289 \subsection{Tuning the ``Simple Tight" criteria}
290 The ``Simple Loose'' criteria utilize variables that describe spatial- and energy-matching of a
291 track to a SC, lateral energy deposition profile of a SC, and a track isolation requirement.
292 Although, these requirements are sufficient to select a relatively clean sample of $Z\to ee$ events,
293 the discriminating power is insufficient to suppress the background from misidentified jets
294 from the $Z+jets$ processes. Therefore, we introduce a ``Simple Tight'' criteria that is based
295 on the ``Simple Loose'' one with an addition of the variable that takes into account the energy
296 deposition in the HCAL.
297
298 The variable that is used by the official criteria is $H/E$. We also study $EmHad$ variable
299 defined in Eq.~\ref{eq:emhad} which is a mix of $H/E$ and an HCAL+ECAL isolation in a
300 relatively narrow cone of $\Delta R < 0.3$.~\footnote{Similarly to the track isolation variable,
301 we varied the cone size to select the most optimal variable.}
302 As expected, these two variables are found to be
303 correlated, although the $EmHad$ has much stronger discriminating power
304 (see Fig.~\ref{fig:Eff_hoE_IsoEmHad}, due to an additional requirement on the lateral energy
305 profile in the 0.3 cone.
306
307 \begin{figure}[!tb]
308 \begin{center}
309 \includegraphics[width=16cm]{Figs/Eff_hoE_IsoEmHad.eps}
310 \vspace{-10mm}
311 \caption{$H/E$ and $EmHad$ variables for barrel (left) and endcap (right).
312 \label{fig:Eff_hoE_IsoEmHad}}
313 \end{center}
314 \end{figure}
315
316 We chose to apply the $EmHad < 0.18 (0.1)$ requirement for barrel (endcap)
317 to the ``Simple Loose'' set (see Fig.~\ref{fig:EmHadIso}). The obtained selection
318 criteria is thus referred to as to ``Simple Tight''. Applying other ECAL or HCAL isolation
319 criteria on top of ``Simple Tight'' does not result in any substantial improvement
320 in performance. Thus, we believe that $EmHad$ and tracker isolation are sufficient
321 to identify the isolated electrons in the $p_T$ range of those from $W$ and $Z$ boson decays.
322
323 \begin{figure}[!tb]
324 \begin{center}
325 \includegraphics[width=16cm]{Figs/EmHadIso.eps}
326 \vspace{-10mm}
327 \caption{$EmHad$ for electrons (solid blue line) and mis-identified jets (red dashed line)
328 in barrel (top left) and endcap (top right). The performance for the discriminator is given below
329 for barrel (bottom left) and endcap (bottom right).
330 \label{fig:EmHadIso}}
331 \end{center}
332 \end{figure}
333
334
335 \subsection{Summary of definitions of the ``Simple Loose'' and ``Simple Tight''}
336
337 To summarize the results, the ``Simple Loose'' criteria is a simple cut-based criteria
338 that utilize three discriminants from the official ``Robust'' ($\Delta\eta$, $\Delta\phi$, and
339 $\sigma_{\eta\eta}$), a modified variable from the ``Loose'' (a replacement of $E/p$ with
340 $E_{seed}/p_{out}$) and an additional track isolation requirement. The thresholds
341 have been also optimized to retain high signal efficiency.
342
343 The ``Simple Tight'' requirement is ``Simple Loose'' with an addition of an effective
344 $H/E$ and ECAL+HCAL isolation requirement.
345
346 We summarize the thresholds for barrel and endcap in Table~\ref{tab:OurSelection}
347 and proceed with detailed analysis of the performance in the next Section.
348
349
350 \begin{table}[htb]
351 \caption{Thresholds and efficinecy of the criteria.}
352 \label{tab:OurSelection}
353 \begin{center}
354 \begin{tabular}{|c|c|c|} \hline
355 & Barrel threshold & Endcap \\ \hline
356 $\Delta\eta < $ & 0.009 & 0.007 \\
357 $\Delta\phi < $ & 0.005 & 0.005 \\
358 $\sigma_{\eta\eta} < $ & 0.012 & 0.026 \\
359 $E_{seed}/p_{out} > $ & 0.9 & 0.9 \\
360 $Iso^{trk}_{Norm} < $ & 0.1 & 0.2 \\
361 $EmHad < $ & 0.18 & 0.1 \\ \hline
362 \end{tabular}
363 \end{center}
364 \end{table}
365