ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/Vuko/Notes/ElectronID/Variables.tex
Revision: 1.5
Committed: Fri Jun 20 17:50:33 2008 UTC (16 years, 10 months ago) by kkaadze
Content type: application/x-tex
Branch: MAIN
CVS Tags: HEAD
Changes since 1.4: +5 -4 lines
Log Message:
ToCompile

File Contents

# User Rev Content
1 beaucero 1.1 Electron is identified by combining information from the CMS
2     tracker and ECAL. Our initial (preselection) requirement is that
3     the electron candidate in addition to the kinematics criteria
4     described in the Section above, is also identified as a GSF electron,
5     {\it i.e.}, it has a GSF track matched with the ECAL Super Cluster (SC).
6     The efficiency of the selection criteria is measured with respect
7     to these initial requirements.
8    
9     The focus of this note is to optimize official loose and tight identification
10 ymaravin 1.4 criteria defined in Appendix~\ref{a:OfficialCriteria}
11     to identify electrons from $Z$ and $W$ decays, respectively.
12 beaucero 1.1 This can be achieved by optimizing the thresholds, optimizing the
13     discriminants that have more background rejection power, and selecting
14     the variables that are not highly correlated with each other.
15     The latter allows keeping the number of variables in the criteria to a
16     minimum, which results in a simpler set of requirements with smaller
17     systematic uncertainties, and thus more robust in the startup conditions.
18    
19     Variables that allow discriminating electrons from em-jets can be roughly
20     divided into two classes: matching/shower-shape and isolation
21     discriminants. In the following we treat these two classes separately to
22     follow the existing egamma POG identification scheme. The variables are
23     described in the next two subsections.
24    
25     \subsection{Identification variables}
26     \label{ss:matching}
27     \subsubsection{Track-ECAL matching}
28     An electron GSF track should be well-matched to the ECAL Super Cluster (SC), while
29     a $\pi^0$ energy deposit in ECAL is not necessarily matched well with a GSF track, which
30 ymaravin 1.4 is usually produced by a charged pion. Thus, a spatial match between a track and a SC can be a good
31 beaucero 1.1 discriminant. This match is described in azimuthal and pseudorapidity planes and denoted as
32 kkaadze 1.2 $\Delta\phi$ and $\Delta\eta$, respectively.
33 beaucero 1.1
34     \subsubsection{ECAL energy width}
35 ymaravin 1.4 \label{ss:etawidth}
36 beaucero 1.1 An electron brems extensively in CMS tracker, which results in a rather wide shower in azimuthal
37     plane due to a strong CMS magnetic field. However, the width of a shower in pseudorapidity
38 kkaadze 1.2 plane remains very narrow and can discriminate against jets, which tend to create rather large
39     clusters in both $\eta$ and $\phi$ directions. We consider two parameterization of the shower width:
40     the one, used in official electron identification developed by egamma POG, $\sigma_{\eta\eta} = \sqrt{CovEtaEta}$ which describes the width of the highest-energy basic cluster of the SC, further referred
41     to as a seed cluster, and an energy-weighted width of SC in $\eta$ direction, defined as
42 beaucero 1.1 \begin{equation}
43     \label{eq:etawidth}
44     \sigma_\eta = \frac{1}{E_{SC}} \sqrt{\sum_{ECAL~SC~RecHits} E_i(\eta_i - \eta_{SC})^2}.
45     \end{equation}
46    
47 ymaravin 1.4 \subsubsection{E/p-based variables}
48     \label{ss:ep}
49     Electron should deposit all of its energy in the ECAL detector, thus the track momentum
50     at the outer edge of tracker $p_{out}$ should be of the same order as an energy of the seed EM
51     cluster $E_{seed}$. Jet is reconstructed combining information from CMS tracker and hadronic calomitere (HCAL). It interact with ECAL much less intensively and most of its energy, carried after tracker, is deposited in HCAL. Hence, $p_{out}$ and $E_{seed}$ differ significantly for the jets and can be used to discriminate them from electrons.
52     We also considered an official version of this discriminant $E_{SC}/p_{in}$, where $E_{SC}$ is a SC energy, and $p_{in}$ is the initial momentum of a charged particle.
53    
54 beaucero 1.1 \subsubsection{H/E variables}
55     One can form a powerful discriminant by using the HCAL and ECAL energies associated
56     with an electron candidate, as electrons tend to deposit very little or no energy in HCAL,
57 ymaravin 1.4 while jets produce a wide energy deposition in the HCAL. A variable used in official
58 kkaadze 1.2 ``Robust" as well as ``Loose" and ``Tight" criteria is the ratio of HCAL and ECAL energies: $H/E$.
59     It peaks around 0 for electrons and can be quite large for em-jets. We form a variable that also
60 beaucero 1.3 takes into account the width of the ECAL energy deposition by making a ratio of the energy deposited
61 beaucero 1.1 in HCAL and ECAL in the cone of $\Delta R < 0.3$ which is not included in the SC, normalized
62     to the SC energy as follows
63    
64     \begin{equation}
65     \label{eq:emhad}
66 ymaravin 1.4 EmHad =\frac{1}{E_{SC}}\left(\sum_{\Delta R=0.3}E^{ecal}_{RecHit} +
67 beaucero 1.1 \sum_{\Delta R=0.3}E^{hcal}_{RecHit} - E_{SC}\right).
68     \end{equation}
69    
70     \subsubsection{Track isolation requirements}
71 ymaravin 1.4 \label{ss:trackIso}
72 beaucero 1.1 As electrons from $W$ and $Z$ boson decays are isolated, requiring electron isolated from tracking
73     activity can significantly suppress em-jets that usually have a large number of soft tracks around the
74     leading $\pi^0$. It also can suppress real electrons from semi-leptonic decays of $b$ quarks which
75 kkaadze 1.5 tend to be non-isolated as well. We consider several versions of the track isolation requirements
76     using CTF tracks around the electron candidates.
77 beaucero 1.1
78     \begin{equation}
79     \label{eq:trkIsoN}
80 ymaravin 1.4 Iso^{trk}_{Norm.} = \frac{1}{p_T(e)}\left(\sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk)\right),
81 beaucero 1.1 \end{equation}
82    
83     and non-normalized version of the above:
84    
85     \begin{equation}
86     \label{eq:trkIso}
87 ymaravin 1.4 Iso^{trk} = \sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk).
88     \end{equation}
89    
90     We also test the track isolation discriminant with the larger outer cone, similar to that defined by electroweak group:~\footnote{Our analysis trees do not store information that would allow us to test
91     the isolation variable with an annulus of $0.02 < \Delta R < 0.6$. However, the performance
92     of a track isolation in an annulus of $0.05 < \Delta R < 0.5$ is similar.}
93     \begin{equation}
94     \label{eq:trkIsoEWK}
95     Iso^{trk}_{EWK} = \sum_{\Delta R = 0.6} p_T(trk) - \sum_{\Delta R = 0.02} p_T(trk).
96 beaucero 1.1 \end{equation}
97    
98 ymaravin 1.4
99 beaucero 1.1
100    
101     \subsection{Optimization method and strategy}
102 ymaravin 1.4 The focus of this work is to define two simple selection criteria: ``Simple Loose" and ``Simple Tight"
103     to select electrons from $Z \to ee$ and $W \to e\nu_e$ processes, respectively. We start building
104     the ``Simple Loose" electron criteria based on the discriminants utilized in the official ``Robust''
105     and adding more variables from the ``Loose" criteria (or their more powerful equivalents described
106     above), and tuning the thresholds to keep the efficiency of each criterion high. We change the order
107     of how the requirements are applied to assess the level of correlation between different criteria.
108     We try to retain only the variables that are as less correlated as possible to keep the number
109     of criteria to a minimum.
110    
111     We design the ``Simple Tight'' requirements by applying additional criteria on top of the
112     unchanged ``Simple Loose''. We chose not to modify thresholds of ``Simple Loose'' criteria
113     to reduce possible systematic uncertainties and to make the process of extracting the
114     efficiency from data easier.
115    
116     Both ``Simple Tight'' and ``Simple Loose'' criteria are designed separately for ECAL
117     barrel and endcap. We consider electron candidates with $p_T > 10$ GeV. The signal
118     electron sample was obtained from the $WZ$ Monte Carlo simulation, and the misidentified
119     jets are obtained from the "Gumbo" soup where we exclude $\gamma+X$ processes.
120    
121     \subsection{Tuning the ``Simple Loose" criteria}
122     The first two criteria of ``Robust'' requirements are $\Delta \eta$ and $\Delta \phi$.
123     The distributions of these variables for signal and background together with the
124     em-jet separation power are given in Figs.~\ref{fig:DeltaEta}~and~\ref{fig:DeltaPhi}.
125     We apply both requirements with the thresholds of 0.009(0.007) and 0.05(0.05) for
126     $\Delta \eta$ and $\Delta \phi$ for barrel(endcap), respectively.
127 kkaadze 1.2
128 beaucero 1.3 \begin{figure}[!tb]
129 kkaadze 1.2 \begin{center}
130     \includegraphics[width=16cm]{Figs/deltaEta.eps}
131     \vspace{-10mm}
132 ymaravin 1.4 \caption{Distributions of $\Delta \eta$ for electrons (blue solid line) and misidentified jets
133     (red dashed line) in barrel (top left) and endcap (top right).
134     The performance of these discriminants in terms of efficiencies to select
135     true electrons and misidentified jets are given below.
136 kkaadze 1.2 \label{fig:DeltaEta}}
137     \end{center}
138     \end{figure}
139    
140    
141 beaucero 1.3 \begin{figure}[!tb]
142 kkaadze 1.2 \begin{center}
143     \includegraphics[width=16cm]{Figs/deltaPhi.eps}
144     \vspace{-10mm}
145 ymaravin 1.4 \caption{The same as Fig.~\ref{fig:DeltaEta}, but for $\Delta \phi$.
146 kkaadze 1.2 \label{fig:DeltaPhi}}
147     \end{center}
148     \end{figure}
149    
150 ymaravin 1.4 The third discriminating variable included in the ``Robust'' criteria describes the width
151     of the EM shower. We study two different parameterizations of this variable
152     described in~\ref{ss:etawidth}. The performance of their discriminating power is given in
153     Fig.~\ref{fig:SigmaEE_vs_EtaWidth}.
154 kkaadze 1.2
155 beaucero 1.3 \begin{figure}[!tb]
156 kkaadze 1.2 \begin{center}
157     \includegraphics[width=16cm]{Figs/Eff_EtaWidth_SigmaEtaEta.eps}
158     \vspace{-10mm}
159 ymaravin 1.4 \caption{The background efficiency $v.s.$ signal efficiency in barrel (left) and endcap (right)
160     for $\eta$ width of SC (red) and $\sigma_{\eta\eta}$ (blue).
161 kkaadze 1.2 \label{fig:SigmaEE_vs_EtaWidth}}
162     \end{center}
163     \end{figure}
164    
165 ymaravin 1.4 As it can be seen, the $\sigma_{\eta\eta}$ variable is more powerful in barrel that can reject $\sim 70\%$
166     of em-jets while loosing only a couple of percents of signal. The performance in endcap is comparable
167     between the two. As $\sigma_{\eta\eta}$ is shown to be described very well by Monte
168     Carlo~\footnote{$\sigma_{\eta\eta}$ was studied using test beam data and was shown to be described
169     in simulation very well.} we chose to select $\sigma_{\eta\eta} < 0.012 (0.026)$ for barrel (endcap).
170     The plot with distributions of $\sigma_{\eta\eta}$ and the performance in barrel and endcap is given
171     in Fig.~\ref{fig:SigmaEtaEta}.
172 beaucero 1.3 \begin{figure}[!tb]
173 kkaadze 1.2 \begin{center}
174 ymaravin 1.4 \includegraphics[width=16cm]{Figs/sigmaEtaEta.eps}
175 kkaadze 1.2 \vspace{-10mm}
176 ymaravin 1.4 \caption{
177     Distributions of $\sigma_{\eta\eta}$ for electrons (blue solid line) and misidentified jets
178     (red dashed line) in barrel (top left) and endcap (top right).
179     The performance of these discriminants in terms of efficiencies to select
180     true electrons and misidentified jets are given below.
181 kkaadze 1.2 \label{fig:SigmaEtaEta}}
182     \end{center}
183     \end{figure}
184    
185 ymaravin 1.4 The discriminants that we chose so far are used in official ``Robust'' criteria.
186     The last requirement from the official criteria is $H/E$ which we chose to keep
187     unchanged from the predefined GSF electron requirement ($H/E < 0.2$).
188     It will be shown that $H/E$ variable is highly correlated with isolation requirements
189     that we will apply later, and thus it does not need to be tightened.
190    
191     As all the discriminants from the ``Robust'' selection are considered, we start including
192     the variables utilized in the official ``Loose'' requirement. At first, we consider
193     $E_{seed}/p_{out}$ and $E/p$, described in Section~\ref{ss:ep}.
194     The latter is used in the official ``Loose'' and ``Tight'' selections to categorize
195     electrons in $E/p$ $v.s.$ $fbrem$ phase space, where $fbrem$ is defined as
196     \begin{equation}
197     \label{eq:fbrem}
198     fbrem = \frac{p_{in} - p_{out}}{p_{in}},
199     \end{equation}
200     where $p_{in}$ and $p_{out}$ are momenta measured at the inner and outer edge of
201     the tracker, respectively. The performance of the $E_{seed}/p_{out}$ and $E/p$
202     are given in Fig.~\ref{fig:EseedOPout_vs_EoP}.
203 kkaadze 1.2
204 beaucero 1.3 \begin{figure}[!tb]
205 kkaadze 1.2 \begin{center}
206     \includegraphics[width=16cm]{Figs/Eff_EoP_EseedoPout.eps}
207     \vspace{-10mm}
208 ymaravin 1.4 \caption{Background $v.s.$ signal efficiency for barrel (left) and endcap (right) for $E/p$ (red)
209     and $E_{seed}/p_{out}$ (blue).
210 kkaadze 1.2 \label{fig:EseedOPout_vs_EoP}}
211     \end{center}
212     \end{figure}
213    
214 ymaravin 1.4 $E_{seed}/p_{out}$ has a better performance and we include it to the ``Simple Loose''
215     requirement to be more than 0.9 for both barrel and endcap. The distributions of this
216     variable and discriminating performance is given in Fig.~\ref{fig:EseedOPout}.
217 kkaadze 1.2
218 beaucero 1.3 \begin{figure}[!tb]
219 kkaadze 1.2 \begin{center}
220     \includegraphics[width=16cm]{Figs/EseedPout.eps}
221     \vspace{-10mm}
222 ymaravin 1.4 \caption{$E_{seed}/p_{out}$ for electrons(blue solid line) and mis-identified jets (red dashed line)
223     in barrel (left top) and endcap (right top). The performance of the criteria for different thresholds
224     is given in the bottom left plot for barrel, and bottom right plot for endcap.
225 kkaadze 1.2 \label{fig:EseedOPout}}
226     \end{center}
227     \end{figure}
228    
229 ymaravin 1.4 Up to this point we utilized all of the variables of the ``Robust'' and ``Loose'' criteria with an exception
230     of a much looser $H/E$ requirement and substitution of $E/p$ with $E_{seed}/p_{out}$ variable.
231     As it is shown later in the note, the performance of this criteria with respect to the official ``Loose'' is
232     very comparable without complexity due to the categorization employed by the ``Loose'' criteria.
233     The improvement is mostly achieved by utilizing a more discriminating variables and tuned
234     thresholds.
235    
236     It is possible to further improve the performance of the ``Simple Loose'' identification by applying
237     track isolation requirements. We study the performance of three various track isolation requirements
238     described in Section~\ref{ss:trackIso} where we varied the cone sizes from 0.5 to 0.05 and select
239     the most optimal values. The results for the variables with optimized cone sizes are given
240     in Fig.~\ref{fig:TrkIso_no_vs_norm}.
241 kkaadze 1.2
242 beaucero 1.3 \begin{figure}[!tb]
243 kkaadze 1.2 \begin{center}
244 ymaravin 1.4 \includegraphics[width=16cm]{Figs/Eff_trkISoNonNorm_trkIsoNorm_trkIsoEWK.eps}
245 kkaadze 1.2 \vspace{-10mm}
246 ymaravin 1.4 \caption{Performance of three track isolation variables for barrel (left) and endcap (right).
247 kkaadze 1.2 \label{fig:TrkIso_no_vs_norm}}
248     \end{center}
249     \end{figure}
250    
251 ymaravin 1.4 The normalized track isolation variable, $Iso^{trk}_{Norm}$, offers the best performance,
252     with an expense of making the efficiency to depend on the electron candidate $p_T$.
253     As it is described in the Section~\ref{Results}, the requirement results in a loss of efficiency
254     at low $p_T$; however, the loss is small in the characteristic range of $p_T$ of electrons
255     from $W$ and $Z$ boson decays.
256    
257     By introducing the isolation requirement, we also reject a significant background from
258     non-isolated electrons from $Zb\bar{b}$ production, where $b$ quarks decay semileptonically.
259     The distribution of $Iso^{trk}_{Norm}$ for signal electrons and misidentified
260     light- and $b$-quark jets are given in Fig.~\ref{fig:TrackIsoChowd}
261    
262     \begin{figure}[!tb]
263     \begin{center}
264     \includegraphics[width=16cm]{Figs/TrackIsoChowd.eps}
265     \vspace{-10mm}
266     \caption{The distribution of the normalized track isolation defined in Eq.~\ref{eq:trkIsoN} for signal
267     electrons (blue solid line), misidentified light-quark jets (red squares), and $b$-quark jets (black inverted
268     triangles) in barrel (left) and endcap (right).
269     \label{fig:TrackIsoChowd}
270     }
271     \end{center}
272     \end{figure}
273    
274     We illustrate the performance of track isolation variable chosen for ``Simple Loose'' criteria
275     in Fig.~\ref{fig:trkIso}, and we set the requirement of $Iso^{trk}_{Norm}$ to be less than
276     0.1 (0.2) for barrel(endcap).
277     \begin{figure}[!tb]
278     \begin{center}
279 kkaadze 1.5 \includegraphics[width=16cm]{Figs/TrkIso.eps}
280 ymaravin 1.4 \vspace{-10mm}
281     \caption{The $Iso^{trk}_{Norm}$ discriminant for electrons (blue soild line) and misidentified jets (red dashed line) for barrel (top left) and endcap (top right). The signal $v.s.$ background efficiencies
282     obtained by varying the threshold are displayed in bottom left (barrel) and bottom right (endcap) plots.
283     \label{fig:trkIso}
284     }
285     \end{center}
286     \end{figure}
287    
288     %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
289 kkaadze 1.2
290 ymaravin 1.4 \subsection{Tuning the ``Simple Tight" criteria}
291     The ``Simple Loose'' criteria utilize variables that describe spatial- and energy-matching of a
292     track to a SC, lateral energy deposition profile of a SC, and a track isolation requirement.
293     Although, these requirements are sufficient to select a relatively clean sample of $Z\to ee$ events,
294     the discriminating power is insufficient to suppress the background from misidentified jets
295     from the $Z+jets$ processes. Therefore, we introduce a ``Simple Tight'' criteria that is based
296     on the ``Simple Loose'' one with an addition of the variable that takes into account the energy
297     deposition in the HCAL.
298    
299     The variable that is used by the official criteria is $H/E$. We also study $EmHad$ variable
300     defined in Eq.~\ref{eq:emhad} which is a mix of $H/E$ and an HCAL+ECAL isolation in a
301     relatively narrow cone of $\Delta R < 0.3$.~\footnote{Similarly to the track isolation variable,
302     we varied the cone size to select the most optimal variable.}
303     As expected, these two variables are found to be
304     correlated, although the $EmHad$ has much stronger discriminating power
305     (see Fig.~\ref{fig:Eff_hoE_IsoEmHad}, due to an additional requirement on the lateral energy
306     profile in the 0.3 cone.
307 kkaadze 1.2
308 ymaravin 1.4 \begin{figure}[!tb]
309     \begin{center}
310 kkaadze 1.5 \includegraphics[width=16cm]{Figs/Eff_HoE_IsoEmHad.eps}
311 ymaravin 1.4 \vspace{-10mm}
312     \caption{$H/E$ and $EmHad$ variables for barrel (left) and endcap (right).
313     \label{fig:Eff_hoE_IsoEmHad}}
314     \end{center}
315     \end{figure}
316 kkaadze 1.2
317 ymaravin 1.4 We chose to apply the $EmHad < 0.18 (0.1)$ requirement for barrel (endcap)
318     to the ``Simple Loose'' set (see Fig.~\ref{fig:EmHadIso}). The obtained selection
319     criteria is thus referred to as to ``Simple Tight''. Applying other ECAL or HCAL isolation
320     criteria on top of ``Simple Tight'' does not result in any substantial improvement
321     in performance. Thus, we believe that $EmHad$ and tracker isolation are sufficient
322     to identify the isolated electrons in the $p_T$ range of those from $W$ and $Z$ boson decays.
323 kkaadze 1.2
324 beaucero 1.3 \begin{figure}[!tb]
325 kkaadze 1.2 \begin{center}
326     \includegraphics[width=16cm]{Figs/EmHadIso.eps}
327     \vspace{-10mm}
328 ymaravin 1.4 \caption{$EmHad$ for electrons (solid blue line) and mis-identified jets (red dashed line)
329     in barrel (top left) and endcap (top right). The performance for the discriminator is given below
330     for barrel (bottom left) and endcap (bottom right).
331 kkaadze 1.2 \label{fig:EmHadIso}}
332     \end{center}
333     \end{figure}
334    
335    
336 ymaravin 1.4 \subsection{Summary of definitions of the ``Simple Loose'' and ``Simple Tight''}
337 kkaadze 1.2
338 ymaravin 1.4 To summarize the results, the ``Simple Loose'' criteria is a simple cut-based criteria
339     that utilize three discriminants from the official ``Robust'' ($\Delta\eta$, $\Delta\phi$, and
340     $\sigma_{\eta\eta}$), a modified variable from the ``Loose'' (a replacement of $E/p$ with
341     $E_{seed}/p_{out}$) and an additional track isolation requirement. The thresholds
342     have been also optimized to retain high signal efficiency.
343 kkaadze 1.2
344 ymaravin 1.4 The ``Simple Tight'' requirement is ``Simple Loose'' with an addition of an effective
345     $H/E$ and ECAL+HCAL isolation requirement.
346 kkaadze 1.2
347 ymaravin 1.4 We summarize the thresholds for barrel and endcap in Table~\ref{tab:OurSelection}
348     and proceed with detailed analysis of the performance in the next Section.
349 kkaadze 1.2
350 ymaravin 1.4 \begin{table}[htb]
351     \caption{Thresholds and efficinecy of the criteria.}
352     \label{tab:OurSelection}
353     \begin{center}
354     \begin{tabular}{|c|c|c|} \hline
355     & Barrel threshold & Endcap \\ \hline
356     $\Delta\eta < $ & 0.009 & 0.007 \\
357     $\Delta\phi < $ & 0.005 & 0.005 \\
358     $\sigma_{\eta\eta} < $ & 0.012 & 0.026 \\
359     $E_{seed}/p_{out} > $ & 0.9 & 0.9 \\
360     $Iso^{trk}_{Norm} < $ & 0.1 & 0.2 \\
361     $EmHad < $ & 0.18 & 0.1 \\ \hline
362     \end{tabular}
363     \end{center}
364     \end{table}
365 beaucero 1.1
366 kkaadze 1.5