ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/Vuko/Notes/ElectronID/Variables.tex
Revision: 1.4
Committed: Fri Jun 20 17:26:18 2008 UTC (16 years, 10 months ago) by ymaravin
Content type: application/x-tex
Branch: MAIN
Changes since 1.3: +211 -117 lines
Log Message:
YM: added Appendix for standard official ID, modified extensively the text, changed the names of eps files to make them compatible with macosx

File Contents

# User Rev Content
1 beaucero 1.1 Electron is identified by combining information from the CMS
2     tracker and ECAL. Our initial (preselection) requirement is that
3     the electron candidate in addition to the kinematics criteria
4     described in the Section above, is also identified as a GSF electron,
5     {\it i.e.}, it has a GSF track matched with the ECAL Super Cluster (SC).
6     The efficiency of the selection criteria is measured with respect
7     to these initial requirements.
8    
9     The focus of this note is to optimize official loose and tight identification
10 ymaravin 1.4 criteria defined in Appendix~\ref{a:OfficialCriteria}
11     to identify electrons from $Z$ and $W$ decays, respectively.
12 beaucero 1.1 This can be achieved by optimizing the thresholds, optimizing the
13     discriminants that have more background rejection power, and selecting
14     the variables that are not highly correlated with each other.
15     The latter allows keeping the number of variables in the criteria to a
16     minimum, which results in a simpler set of requirements with smaller
17     systematic uncertainties, and thus more robust in the startup conditions.
18    
19     Variables that allow discriminating electrons from em-jets can be roughly
20     divided into two classes: matching/shower-shape and isolation
21     discriminants. In the following we treat these two classes separately to
22     follow the existing egamma POG identification scheme. The variables are
23     described in the next two subsections.
24    
25     \subsection{Identification variables}
26     \label{ss:matching}
27     \subsubsection{Track-ECAL matching}
28     An electron GSF track should be well-matched to the ECAL Super Cluster (SC), while
29     a $\pi^0$ energy deposit in ECAL is not necessarily matched well with a GSF track, which
30 ymaravin 1.4 is usually produced by a charged pion. Thus, a spatial match between a track and a SC can be a good
31 beaucero 1.1 discriminant. This match is described in azimuthal and pseudorapidity planes and denoted as
32 kkaadze 1.2 $\Delta\phi$ and $\Delta\eta$, respectively.
33 beaucero 1.1
34     \subsubsection{ECAL energy width}
35 ymaravin 1.4 \label{ss:etawidth}
36 beaucero 1.1 An electron brems extensively in CMS tracker, which results in a rather wide shower in azimuthal
37     plane due to a strong CMS magnetic field. However, the width of a shower in pseudorapidity
38 kkaadze 1.2 plane remains very narrow and can discriminate against jets, which tend to create rather large
39     clusters in both $\eta$ and $\phi$ directions. We consider two parameterization of the shower width:
40     the one, used in official electron identification developed by egamma POG, $\sigma_{\eta\eta} = \sqrt{CovEtaEta}$ which describes the width of the highest-energy basic cluster of the SC, further referred
41     to as a seed cluster, and an energy-weighted width of SC in $\eta$ direction, defined as
42 beaucero 1.1 \begin{equation}
43     \label{eq:etawidth}
44     \sigma_\eta = \frac{1}{E_{SC}} \sqrt{\sum_{ECAL~SC~RecHits} E_i(\eta_i - \eta_{SC})^2}.
45     \end{equation}
46    
47 ymaravin 1.4 \subsubsection{E/p-based variables}
48     \label{ss:ep}
49     Electron should deposit all of its energy in the ECAL detector, thus the track momentum
50     at the outer edge of tracker $p_{out}$ should be of the same order as an energy of the seed EM
51     cluster $E_{seed}$. Jet is reconstructed combining information from CMS tracker and hadronic calomitere (HCAL). It interact with ECAL much less intensively and most of its energy, carried after tracker, is deposited in HCAL. Hence, $p_{out}$ and $E_{seed}$ differ significantly for the jets and can be used to discriminate them from electrons.
52     We also considered an official version of this discriminant $E_{SC}/p_{in}$, where $E_{SC}$ is a SC energy, and $p_{in}$ is the initial momentum of a charged particle.
53    
54 beaucero 1.1 \subsubsection{H/E variables}
55     One can form a powerful discriminant by using the HCAL and ECAL energies associated
56     with an electron candidate, as electrons tend to deposit very little or no energy in HCAL,
57 ymaravin 1.4 while jets produce a wide energy deposition in the HCAL. A variable used in official
58 kkaadze 1.2 ``Robust" as well as ``Loose" and ``Tight" criteria is the ratio of HCAL and ECAL energies: $H/E$.
59     It peaks around 0 for electrons and can be quite large for em-jets. We form a variable that also
60 beaucero 1.3 takes into account the width of the ECAL energy deposition by making a ratio of the energy deposited
61 beaucero 1.1 in HCAL and ECAL in the cone of $\Delta R < 0.3$ which is not included in the SC, normalized
62     to the SC energy as follows
63    
64     \begin{equation}
65     \label{eq:emhad}
66 ymaravin 1.4 EmHad =\frac{1}{E_{SC}}\left(\sum_{\Delta R=0.3}E^{ecal}_{RecHit} +
67 beaucero 1.1 \sum_{\Delta R=0.3}E^{hcal}_{RecHit} - E_{SC}\right).
68     \end{equation}
69    
70     \subsubsection{Track isolation requirements}
71 ymaravin 1.4 \label{ss:trackIso}
72 beaucero 1.1 As electrons from $W$ and $Z$ boson decays are isolated, requiring electron isolated from tracking
73     activity can significantly suppress em-jets that usually have a large number of soft tracks around the
74     leading $\pi^0$. It also can suppress real electrons from semi-leptonic decays of $b$ quarks which
75     tend to be non-isolated as well. We consider several versions of the track isolation requirements:
76    
77     \begin{equation}
78     \label{eq:trkIsoN}
79 ymaravin 1.4 Iso^{trk}_{Norm.} = \frac{1}{p_T(e)}\left(\sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk)\right),
80 beaucero 1.1 \end{equation}
81    
82     and non-normalized version of the above:
83    
84     \begin{equation}
85     \label{eq:trkIso}
86 ymaravin 1.4 Iso^{trk} = \sum_{\Delta R=0.3} p_T(trk) - \sum_{\Delta R=0.05} p_T(trk).
87     \end{equation}
88    
89     We also test the track isolation discriminant with the larger outer cone, similar to that defined by electroweak group:~\footnote{Our analysis trees do not store information that would allow us to test
90     the isolation variable with an annulus of $0.02 < \Delta R < 0.6$. However, the performance
91     of a track isolation in an annulus of $0.05 < \Delta R < 0.5$ is similar.}
92     \begin{equation}
93     \label{eq:trkIsoEWK}
94     Iso^{trk}_{EWK} = \sum_{\Delta R = 0.6} p_T(trk) - \sum_{\Delta R = 0.02} p_T(trk).
95 beaucero 1.1 \end{equation}
96    
97 ymaravin 1.4
98 beaucero 1.1
99    
100     \subsection{Optimization method and strategy}
101 ymaravin 1.4 The focus of this work is to define two simple selection criteria: ``Simple Loose" and ``Simple Tight"
102     to select electrons from $Z \to ee$ and $W \to e\nu_e$ processes, respectively. We start building
103     the ``Simple Loose" electron criteria based on the discriminants utilized in the official ``Robust''
104     and adding more variables from the ``Loose" criteria (or their more powerful equivalents described
105     above), and tuning the thresholds to keep the efficiency of each criterion high. We change the order
106     of how the requirements are applied to assess the level of correlation between different criteria.
107     We try to retain only the variables that are as less correlated as possible to keep the number
108     of criteria to a minimum.
109    
110     We design the ``Simple Tight'' requirements by applying additional criteria on top of the
111     unchanged ``Simple Loose''. We chose not to modify thresholds of ``Simple Loose'' criteria
112     to reduce possible systematic uncertainties and to make the process of extracting the
113     efficiency from data easier.
114    
115     Both ``Simple Tight'' and ``Simple Loose'' criteria are designed separately for ECAL
116     barrel and endcap. We consider electron candidates with $p_T > 10$ GeV. The signal
117     electron sample was obtained from the $WZ$ Monte Carlo simulation, and the misidentified
118     jets are obtained from the "Gumbo" soup where we exclude $\gamma+X$ processes.
119    
120     \subsection{Tuning the ``Simple Loose" criteria}
121     The first two criteria of ``Robust'' requirements are $\Delta \eta$ and $\Delta \phi$.
122     The distributions of these variables for signal and background together with the
123     em-jet separation power are given in Figs.~\ref{fig:DeltaEta}~and~\ref{fig:DeltaPhi}.
124     We apply both requirements with the thresholds of 0.009(0.007) and 0.05(0.05) for
125     $\Delta \eta$ and $\Delta \phi$ for barrel(endcap), respectively.
126 kkaadze 1.2
127 beaucero 1.3 \begin{figure}[!tb]
128 kkaadze 1.2 \begin{center}
129     \includegraphics[width=16cm]{Figs/deltaEta.eps}
130     \vspace{-10mm}
131 ymaravin 1.4 \caption{Distributions of $\Delta \eta$ for electrons (blue solid line) and misidentified jets
132     (red dashed line) in barrel (top left) and endcap (top right).
133     The performance of these discriminants in terms of efficiencies to select
134     true electrons and misidentified jets are given below.
135 kkaadze 1.2 \label{fig:DeltaEta}}
136     \end{center}
137     \end{figure}
138    
139    
140 beaucero 1.3 \begin{figure}[!tb]
141 kkaadze 1.2 \begin{center}
142     \includegraphics[width=16cm]{Figs/deltaPhi.eps}
143     \vspace{-10mm}
144 ymaravin 1.4 \caption{The same as Fig.~\ref{fig:DeltaEta}, but for $\Delta \phi$.
145 kkaadze 1.2 \label{fig:DeltaPhi}}
146     \end{center}
147     \end{figure}
148    
149 ymaravin 1.4 The third discriminating variable included in the ``Robust'' criteria describes the width
150     of the EM shower. We study two different parameterizations of this variable
151     described in~\ref{ss:etawidth}. The performance of their discriminating power is given in
152     Fig.~\ref{fig:SigmaEE_vs_EtaWidth}.
153 kkaadze 1.2
154 beaucero 1.3 \begin{figure}[!tb]
155 kkaadze 1.2 \begin{center}
156     \includegraphics[width=16cm]{Figs/Eff_EtaWidth_SigmaEtaEta.eps}
157     \vspace{-10mm}
158 ymaravin 1.4 \caption{The background efficiency $v.s.$ signal efficiency in barrel (left) and endcap (right)
159     for $\eta$ width of SC (red) and $\sigma_{\eta\eta}$ (blue).
160 kkaadze 1.2 \label{fig:SigmaEE_vs_EtaWidth}}
161     \end{center}
162     \end{figure}
163    
164 ymaravin 1.4 As it can be seen, the $\sigma_{\eta\eta}$ variable is more powerful in barrel that can reject $\sim 70\%$
165     of em-jets while loosing only a couple of percents of signal. The performance in endcap is comparable
166     between the two. As $\sigma_{\eta\eta}$ is shown to be described very well by Monte
167     Carlo~\footnote{$\sigma_{\eta\eta}$ was studied using test beam data and was shown to be described
168     in simulation very well.} we chose to select $\sigma_{\eta\eta} < 0.012 (0.026)$ for barrel (endcap).
169     The plot with distributions of $\sigma_{\eta\eta}$ and the performance in barrel and endcap is given
170     in Fig.~\ref{fig:SigmaEtaEta}.
171 beaucero 1.3 \begin{figure}[!tb]
172 kkaadze 1.2 \begin{center}
173 ymaravin 1.4 \includegraphics[width=16cm]{Figs/sigmaEtaEta.eps}
174 kkaadze 1.2 \vspace{-10mm}
175 ymaravin 1.4 \caption{
176     Distributions of $\sigma_{\eta\eta}$ for electrons (blue solid line) and misidentified jets
177     (red dashed line) in barrel (top left) and endcap (top right).
178     The performance of these discriminants in terms of efficiencies to select
179     true electrons and misidentified jets are given below.
180 kkaadze 1.2 \label{fig:SigmaEtaEta}}
181     \end{center}
182     \end{figure}
183    
184 ymaravin 1.4 The discriminants that we chose so far are used in official ``Robust'' criteria.
185     The last requirement from the official criteria is $H/E$ which we chose to keep
186     unchanged from the predefined GSF electron requirement ($H/E < 0.2$).
187     It will be shown that $H/E$ variable is highly correlated with isolation requirements
188     that we will apply later, and thus it does not need to be tightened.
189    
190     As all the discriminants from the ``Robust'' selection are considered, we start including
191     the variables utilized in the official ``Loose'' requirement. At first, we consider
192     $E_{seed}/p_{out}$ and $E/p$, described in Section~\ref{ss:ep}.
193     The latter is used in the official ``Loose'' and ``Tight'' selections to categorize
194     electrons in $E/p$ $v.s.$ $fbrem$ phase space, where $fbrem$ is defined as
195     \begin{equation}
196     \label{eq:fbrem}
197     fbrem = \frac{p_{in} - p_{out}}{p_{in}},
198     \end{equation}
199     where $p_{in}$ and $p_{out}$ are momenta measured at the inner and outer edge of
200     the tracker, respectively. The performance of the $E_{seed}/p_{out}$ and $E/p$
201     are given in Fig.~\ref{fig:EseedOPout_vs_EoP}.
202 kkaadze 1.2
203 beaucero 1.3 \begin{figure}[!tb]
204 kkaadze 1.2 \begin{center}
205     \includegraphics[width=16cm]{Figs/Eff_EoP_EseedoPout.eps}
206     \vspace{-10mm}
207 ymaravin 1.4 \caption{Background $v.s.$ signal efficiency for barrel (left) and endcap (right) for $E/p$ (red)
208     and $E_{seed}/p_{out}$ (blue).
209 kkaadze 1.2 \label{fig:EseedOPout_vs_EoP}}
210     \end{center}
211     \end{figure}
212    
213 ymaravin 1.4 $E_{seed}/p_{out}$ has a better performance and we include it to the ``Simple Loose''
214     requirement to be more than 0.9 for both barrel and endcap. The distributions of this
215     variable and discriminating performance is given in Fig.~\ref{fig:EseedOPout}.
216 kkaadze 1.2
217 beaucero 1.3 \begin{figure}[!tb]
218 kkaadze 1.2 \begin{center}
219     \includegraphics[width=16cm]{Figs/EseedPout.eps}
220     \vspace{-10mm}
221 ymaravin 1.4 \caption{$E_{seed}/p_{out}$ for electrons(blue solid line) and mis-identified jets (red dashed line)
222     in barrel (left top) and endcap (right top). The performance of the criteria for different thresholds
223     is given in the bottom left plot for barrel, and bottom right plot for endcap.
224 kkaadze 1.2 \label{fig:EseedOPout}}
225     \end{center}
226     \end{figure}
227    
228 ymaravin 1.4 Up to this point we utilized all of the variables of the ``Robust'' and ``Loose'' criteria with an exception
229     of a much looser $H/E$ requirement and substitution of $E/p$ with $E_{seed}/p_{out}$ variable.
230     As it is shown later in the note, the performance of this criteria with respect to the official ``Loose'' is
231     very comparable without complexity due to the categorization employed by the ``Loose'' criteria.
232     The improvement is mostly achieved by utilizing a more discriminating variables and tuned
233     thresholds.
234    
235     It is possible to further improve the performance of the ``Simple Loose'' identification by applying
236     track isolation requirements. We study the performance of three various track isolation requirements
237     described in Section~\ref{ss:trackIso} where we varied the cone sizes from 0.5 to 0.05 and select
238     the most optimal values. The results for the variables with optimized cone sizes are given
239     in Fig.~\ref{fig:TrkIso_no_vs_norm}.
240 kkaadze 1.2
241 beaucero 1.3 \begin{figure}[!tb]
242 kkaadze 1.2 \begin{center}
243 ymaravin 1.4 \includegraphics[width=16cm]{Figs/Eff_trkISoNonNorm_trkIsoNorm_trkIsoEWK.eps}
244 kkaadze 1.2 \vspace{-10mm}
245 ymaravin 1.4 \caption{Performance of three track isolation variables for barrel (left) and endcap (right).
246 kkaadze 1.2 \label{fig:TrkIso_no_vs_norm}}
247     \end{center}
248     \end{figure}
249    
250 ymaravin 1.4 The normalized track isolation variable, $Iso^{trk}_{Norm}$, offers the best performance,
251     with an expense of making the efficiency to depend on the electron candidate $p_T$.
252     As it is described in the Section~\ref{Results}, the requirement results in a loss of efficiency
253     at low $p_T$; however, the loss is small in the characteristic range of $p_T$ of electrons
254     from $W$ and $Z$ boson decays.
255    
256     By introducing the isolation requirement, we also reject a significant background from
257     non-isolated electrons from $Zb\bar{b}$ production, where $b$ quarks decay semileptonically.
258     The distribution of $Iso^{trk}_{Norm}$ for signal electrons and misidentified
259     light- and $b$-quark jets are given in Fig.~\ref{fig:TrackIsoChowd}
260    
261     \begin{figure}[!tb]
262     \begin{center}
263     \includegraphics[width=16cm]{Figs/TrackIsoChowd.eps}
264     \vspace{-10mm}
265     \caption{The distribution of the normalized track isolation defined in Eq.~\ref{eq:trkIsoN} for signal
266     electrons (blue solid line), misidentified light-quark jets (red squares), and $b$-quark jets (black inverted
267     triangles) in barrel (left) and endcap (right).
268     \label{fig:TrackIsoChowd}
269     }
270     \end{center}
271     \end{figure}
272    
273     We illustrate the performance of track isolation variable chosen for ``Simple Loose'' criteria
274     in Fig.~\ref{fig:trkIso}, and we set the requirement of $Iso^{trk}_{Norm}$ to be less than
275     0.1 (0.2) for barrel(endcap).
276     \begin{figure}[!tb]
277     \begin{center}
278     \includegraphics[width=16cm]{Figs/trkIso.eps}
279     \vspace{-10mm}
280     \caption{The $Iso^{trk}_{Norm}$ discriminant for electrons (blue soild line) and misidentified jets (red dashed line) for barrel (top left) and endcap (top right). The signal $v.s.$ background efficiencies
281     obtained by varying the threshold are displayed in bottom left (barrel) and bottom right (endcap) plots.
282     \label{fig:trkIso}
283     }
284     \end{center}
285     \end{figure}
286    
287     %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
288 kkaadze 1.2
289 ymaravin 1.4 \subsection{Tuning the ``Simple Tight" criteria}
290     The ``Simple Loose'' criteria utilize variables that describe spatial- and energy-matching of a
291     track to a SC, lateral energy deposition profile of a SC, and a track isolation requirement.
292     Although, these requirements are sufficient to select a relatively clean sample of $Z\to ee$ events,
293     the discriminating power is insufficient to suppress the background from misidentified jets
294     from the $Z+jets$ processes. Therefore, we introduce a ``Simple Tight'' criteria that is based
295     on the ``Simple Loose'' one with an addition of the variable that takes into account the energy
296     deposition in the HCAL.
297    
298     The variable that is used by the official criteria is $H/E$. We also study $EmHad$ variable
299     defined in Eq.~\ref{eq:emhad} which is a mix of $H/E$ and an HCAL+ECAL isolation in a
300     relatively narrow cone of $\Delta R < 0.3$.~\footnote{Similarly to the track isolation variable,
301     we varied the cone size to select the most optimal variable.}
302     As expected, these two variables are found to be
303     correlated, although the $EmHad$ has much stronger discriminating power
304     (see Fig.~\ref{fig:Eff_hoE_IsoEmHad}, due to an additional requirement on the lateral energy
305     profile in the 0.3 cone.
306 kkaadze 1.2
307 ymaravin 1.4 \begin{figure}[!tb]
308     \begin{center}
309     \includegraphics[width=16cm]{Figs/Eff_hoE_IsoEmHad.eps}
310     \vspace{-10mm}
311     \caption{$H/E$ and $EmHad$ variables for barrel (left) and endcap (right).
312     \label{fig:Eff_hoE_IsoEmHad}}
313     \end{center}
314     \end{figure}
315 kkaadze 1.2
316 ymaravin 1.4 We chose to apply the $EmHad < 0.18 (0.1)$ requirement for barrel (endcap)
317     to the ``Simple Loose'' set (see Fig.~\ref{fig:EmHadIso}). The obtained selection
318     criteria is thus referred to as to ``Simple Tight''. Applying other ECAL or HCAL isolation
319     criteria on top of ``Simple Tight'' does not result in any substantial improvement
320     in performance. Thus, we believe that $EmHad$ and tracker isolation are sufficient
321     to identify the isolated electrons in the $p_T$ range of those from $W$ and $Z$ boson decays.
322 kkaadze 1.2
323 beaucero 1.3 \begin{figure}[!tb]
324 kkaadze 1.2 \begin{center}
325     \includegraphics[width=16cm]{Figs/EmHadIso.eps}
326     \vspace{-10mm}
327 ymaravin 1.4 \caption{$EmHad$ for electrons (solid blue line) and mis-identified jets (red dashed line)
328     in barrel (top left) and endcap (top right). The performance for the discriminator is given below
329     for barrel (bottom left) and endcap (bottom right).
330 kkaadze 1.2 \label{fig:EmHadIso}}
331     \end{center}
332     \end{figure}
333    
334    
335 ymaravin 1.4 \subsection{Summary of definitions of the ``Simple Loose'' and ``Simple Tight''}
336 kkaadze 1.2
337 ymaravin 1.4 To summarize the results, the ``Simple Loose'' criteria is a simple cut-based criteria
338     that utilize three discriminants from the official ``Robust'' ($\Delta\eta$, $\Delta\phi$, and
339     $\sigma_{\eta\eta}$), a modified variable from the ``Loose'' (a replacement of $E/p$ with
340     $E_{seed}/p_{out}$) and an additional track isolation requirement. The thresholds
341     have been also optimized to retain high signal efficiency.
342 kkaadze 1.2
343 ymaravin 1.4 The ``Simple Tight'' requirement is ``Simple Loose'' with an addition of an effective
344     $H/E$ and ECAL+HCAL isolation requirement.
345 kkaadze 1.2
346 ymaravin 1.4 We summarize the thresholds for barrel and endcap in Table~\ref{tab:OurSelection}
347     and proceed with detailed analysis of the performance in the next Section.
348 kkaadze 1.2
349    
350 ymaravin 1.4 \begin{table}[htb]
351     \caption{Thresholds and efficinecy of the criteria.}
352     \label{tab:OurSelection}
353     \begin{center}
354     \begin{tabular}{|c|c|c|} \hline
355     & Barrel threshold & Endcap \\ \hline
356     $\Delta\eta < $ & 0.009 & 0.007 \\
357     $\Delta\phi < $ & 0.005 & 0.005 \\
358     $\sigma_{\eta\eta} < $ & 0.012 & 0.026 \\
359     $E_{seed}/p_{out} > $ & 0.9 & 0.9 \\
360     $Iso^{trk}_{Norm} < $ & 0.1 & 0.2 \\
361     $EmHad < $ & 0.18 & 0.1 \\ \hline
362     \end{tabular}
363     \end{center}
364     \end{table}
365 beaucero 1.1