ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex
(Generate patch)

Comparing UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex (file contents):
Revision 1.5 by vuko, Sat Jun 21 15:32:35 2008 UTC vs.
Revision 1.13 by beaucero, Fri Jun 27 15:21:57 2008 UTC

# Line 1 | Line 1
1   \section{Signal extraction}
2 < %\label{sec:gen}
3 < The probability to misidentify a jet as a muon is very low while for
4 < the case of the electron, $\pi^0$ in jets can be misidentified as
5 < electrons. When considering the subtraction of the background in this
6 < analysis, we will mainly concentrate on the final state where the $W$
7 < is decaying to an electron. For the muon case, the subtraction can be
8 < done using Monte Carlo sample and assigning a large error on this
9 < estimation.
10 <
11 < \subsection{Z+jets background fraction}
12 < The main background remaining even after having applied all our selection
13 < in the case of the $W$ decaying to electron is the $Z+jets$
14 < production. As signal and background have a \Z boson in the final
15 < state, we will concentrate on the third lepton which is an electron in
16 < this study.
17 <
18 < In order to select the \Z candidate in the events, we apply loose
19 < criteria. The loose sample contain a given number of signal events
20 < which contains a third isolated electron and a given number of
21 < background events which do not contains a third isolated
22 < electron. When we apply the tight criteria the fraction of signal and
23 < background events is changing according to the efficiency of the
24 < criteria. This can be expressed by this formula:
25 < \begin{eqnarray}
26 < N_{loose} & = & \hspace*{0.9cm}               N_e +   \hspace*{0.9cm}   N_{j} \\
27 < N_{tight} & = & \epsilon_{tight} N_e  + p_{fake}  N_{j}
28 < \end{eqnarray}
29 < Where $N_{loose}$ and $N_{tight}$ are the numbers of events in
30 < the loose and tight samples, respectively, $N_e$ is the number of events with a third
31 < isolated electron, $N_j$ is the number of events without a third
32 < isolated electron, $\epsilon_{tight}$ is the efficiency of the tight
33 < criteria on electron, $p_{fake}$ is the probability for a jet identified
34 < as a loose electron to be also identified as a tight electron.  By
35 < solving this set of equations we obtain:
36 < $$
37 < N_e     = \frac{N_{tight}-p_{fake} N_{loose}} { \epsilon_{tight} -p_{fake}} \ \ \ \mbox{and} \ \ \
38 < N_{j} = \frac{ \epsilon_{track} N_{loose} - N_e}{  \epsilon_{tight} -p_{fake}}
39 < $$
40 <
41 < The estimation of $\epsilon_{tight}$ and $p_{fake}$ can be done using control
42 < samples, containing electrons and jet respectively with high purity. The Tag and probe
43 < method is used to determine signal efficiency $\epsilon_{tight}$. The rate
44 < will be derived from $Z \to e^+e^-$ samples. In the following section we
45 < describe the method used to determine $p_{fake}$.
46 <
47 < \subsection{Determination of $p_{fake}$}
2 > \label{sec:SignalExt}
3 > We separate backgrounds into two categories: one with a genuine \Z boson
4 > from $\Z+jets$ processes, and the other without a genuine \Z boson from
5 > $t\bar{t}$ and $\W+jet$ production. The latter source can be estimated from
6 > the invariant mass of the \Z boson candidate, where the background events
7 > with no genuine \Z boson should not produce a \Z mass peak and should
8 > be relatively smooth.
9 >
10 > \subsection{Study of the background without a genuine \Z boson}
11 > We estimate the background due to events without a genuine \Z boson from
12 > fitting a \Z candidate invariant mass to a Gaussian function convoluted
13 > with a Breit-Wigner function. The background is parameterized as a straight
14 > line. An example of a fit for $3e$ category is given in Fig.~\ref{fig:ZFit}.
15 > Both number of signal and background events are calculated for the
16 > invariant mass range between 81 and 101 GeV. In Table~\ref{tab:FitVsMC}
17 > we summarize the number of background events obtained from the fit
18 > and from the Monte Carlo truth information.
19 >
20 > \begin{figure}[!bp]
21 >  \begin{center}
22 >  \scalebox{0.4}{\includegraphics{figs/FitBkg3eTight.eps}}
23 >  \caption{The invariant mass distribution of the $Z$ boson candidate that is fit to a signal
24 >  parameterized as a Gaussian function convoluted with a Breit-Wigner function and
25 >  a background, parameterized as a straight line.}
26 >  \label{fig:ZFit}
27 >  \end{center}
28 > \end{figure}
29 >
30 > \begin{table}[!tb]
31 > \begin{center}
32 > \begin{tabular}{|l|c|c|c|c|c|c|c|} \hline
33 >                    & \multicolumn{2}{c|}{Background with genuine \Z} & \multicolumn{4}{c|}{Background without
34 >                    genuine \Z boson} \\
35 > Channel    & $\Z+jets$ & $\Z b\bar{b}$ &   $t\bar{t}$ & $\W+jets$ & $t\bar{t}$ + $\W+jets$ & Fit result \\ \hline
36 > $3e$ Loose & 152.0 & 43.1 & 6.1 & 0 & 6.1& 3.8 $\pm$ 1.1 \\ \hline
37 > $3e$ Tight & 44.9 & 17.1 & 2.7 & 0 & 2.7 & 2.8 $\pm$ 1.0 \\ \hline  
38 > $2\mu 1e$ Loose & 189.6 & 52.6 & 4.7 & 0 & 4.7 & 3.5 $\pm$ 1.1\\ \hline
39 > $2\mu 1e$ Tight & 63.1 & 19.0 & 1.4 & 0 & 1.4 & 1.9 $\pm$ 0.8\\ \hline
40 > $2e1\mu$   & 9.2 & 8.2 & 2.5 & 1.5 & 4.0 & 0.7 $\pm$ 0.5\\ \hline
41 > $3\mu$     & 1.9 & 7.8 & 0.7 & 0 & 0.7 & 0.6 $\pm$ 0.5\\ \hline
42 > \end{tabular}
43 > \end{center}
44 > \caption{Comparison between Monte Carlo truth information and the results of the fit for the background without genuine \Z boson. Number of events are obtained in the invariant mass range between 81 and 101 GeV. The ``Loose'' and ``Tight'' selection criteria applied for $W\rightarrow e\nu$ final state only.
45 > %I AM NOT SURE I UNDERSTAND WHAT IS WRITTEN HERE
46 > % One has to consider that this study as been perform on a smaller sample than the other part of the analysis a 10\% statistics error as to be counted until the study is performed on the whole samples.
47 > }
48 > \label{tab:FitVsMC}
49 > \end{table}
50 >
51 >
52 > \subsection{Estimation of the background with genuine \Z boson}
53 > \label{sec:D0Matrix}
54 > All of the instrumental background events with real \Z boson come from
55 > $\Z + jets$ processes where one of the jets is misidentified as a
56 > lepton.  The same principle will be used for electron and muon while
57 > the background is coming from differente sources.
58 > The probability to misidentify a jet as a muon is very low in CMS, while that
59 > for the case of electron can be quite high as jets with large electromagnetic
60 > energy fraction can be misidentified as electrons. $\Z+jet$ background
61 > is especially high for \WZ\ signal with $\W\to e\nu$. Thus, it is imperative
62 > to have a reliable estimation of this background from data to avoid
63 > unnecessary systematic uncertainties due to Monte Carlo description of data
64 > in startup conditions. Therefore, in the following we describe the data-driven
65 > estimation of the $\Z+jets$ background for the $\ell^+\ell^- e$
66 > categories. A similar study for the remaining $\ell^+\ell^- \mu$ categories
67 > are in progress. However, as the $\Z+jets$ background is sufficiently small, it is
68 > possible to use Monte Carlo simulation to estimate $\Z+jet$ background with
69 > early data, without incurring significant systematic uncertainty due to data modeling.
70 >
71 > \subsubsection{$\Z+jets$ background fraction}
72 > To estimate the fraction of the $\Z+jets$ events in data
73 > we apply a method, commonly referred to as ``matrix'' method.
74 > The idea of a method is to apply ``Loose'' identification criteria
75 > on the third lepton after \Z boson candidate is identified
76 > and count the number of the observed events, $N_{loose}$.
77 > These events contain events with real electrons $N_{e}$
78 > and events with misidentified jets $N_j$:
79 > \begin{equation}
80 > \label{eq:matrixEq1}
81 > N_{loose} = N_e + N_j.
82 > \end{equation}
83 >
84 > If we are to apply ``Tight'' selection on the third lepton, the number
85 > of the observed events $N_{tight}$ would change as following
86 > \begin{equation}
87 > \label{eq:matrixEq2}
88 > N_{tight} = \epsilon_{tight} N_e + p_{fake} N_j,
89 > \end{equation}
90 > where $\epsilon_{tight}$ and $p_{fake}$ are efficiency of ``Tight''
91 > criteria with respect to ``Loose'' requirements for electrons and
92 > misidentified jets, respectively. As $N_{loose}$ and $N_{tight}$
93 > are directly observable, to extract the number of $Z+jet$ events
94 > in the final sample, one needs to measure $\epsilon_{tight}$
95 > and $p_{fake}$ in control data samples. Two possible ways
96 > to estimate these values are given below.
97 >
98 > \subsubsection{Determination of $\epsilon_{tight}$}
99 >
100 > % UNDEERSTAND THIS FIT: THIS CANNOT BE SHOWN AS IT IS!!!
101 > %\begin{figure}[bt]
102 > %  \begin{center}
103 > %  \scalebox{0.8}{\includegraphics{figs/tag_probe_fit.eps}}
104 > %  \caption{Invariant mass of the \Z boson candidate for ``Tight-Tight'' (a)
105 > %  and ``Tight-Loose'' (b) electron selections fitted to a Gaussian with
106 > %  bifurcated Breit-Wigner functions.}
107 > %  \label{fig:tagprobe}
108 > %  \end{center}
109 > %\end{figure}
110 >
111 > To estimate the $\epsilon_{tight}$ we apply ``tag-and-probe'' method
112 > using $\Z \to e^+e^-$ from \Z+jets Chowder sample, including  \W+jets
113 > and $t\bar{t}$ as background. \Z mass distribution is separated for two cases where
114 > electrons from \Z decay either both pass ``Tigh'' selection (``Tight-Tight'' case), or only
115 > one passes the ``Tight'' selection, while the other electron passes ``Loose'' but not ``Tight''
116 > selection (``Tight-Loose'' case). To estimate signal in the selected \Z candidate invariant mass distribution, we fit it to a Gaussisan with bifurcated Breit-Wigner function as a signal
117 > and straight line for a background model. \Z mass distribution and fit are shown in \ref{fig:tagprobe}.
118 >
119 > Equation for determination of signal efficiency is given as
120 > \begin{equation}
121 > \epsilon_{tight}=\frac{ 2*(N_{TT}-B_{TT}) }{ (N_{TL}-B_{TL})+2*(N_{TT}-B_{TT}) }
122 > \end{equation}
123 >
124 > where $N_{TT}$,$B_{TT}$,$N_{TL}$ and $B_{TL}$ are, respectively, number of signal+background
125 > and background events for ``Tight-Tight'' and ``Loose-Tight'' electron combinations.
126 > %We estimated an efficiency $\epsilon_{tight}=0.99 \pm  0.01$.
127 >
128 > \subsubsection{Determination of $p_{fake}$}
129  
130   As the events will be most of the time triggered by the leptons coming
131 < from \Z boson, we assume that the third lepton is unbiased toward
132 < trigger requirement. Ideally we need a sample of pure multi-jet events
131 > from \Z boson, we assume that the third lepton is unbiased toward the
132 > trigger requirement. Ideally, we need a sample of pure multi-jet events
133   in order to compute the probability for a jet identified as a loose
134   electron to be also identified as a tight electron. In selecting such
135   a sample in data, one has to avoid any bias from the trigger
136 < requirements on the loose electron candidate.
136 > requirements on the ``Loose'' electron candidate.
137   %Such sample will not
138   %exist in data as they will be bias by the trigger requirement.
139 + \begin{figure}[bt]
140 +  \begin{center}
141 +  \scalebox{0.6}{\includegraphics{figs/tight_eff_gumbo.eps}}
142 +  \caption{Fraction of electron candidates passing the ``Tight'' criteria
143 +    in multijet event. No trigger requirement has been applied.{\em NEW PLOT
144 +    WITH TRIGGER REQUIREMENTS TO COME}}
145 +  \label{fig:qcd_efftight_noHLT}
146 +  \end{center}
147 + \end{figure}
148 +
149 + From a sample of multijet events triggered by an ``OR'' of multi-jet
150 + triggers, we select a ``Loose'' electron candidate that are not
151 + matched to any of the trigger objects. We also require the
152 + electron candidate to be separated from the jet that satisfies
153 + the trigger requirement by requiring the candidate to be separated
154 + by at least $\Delta R = 0.2$ from the trigger object.
155 + This allows us to obtain an unbiased sample of multijet events
156 + where an electron candidate is likely to be either a converted
157 + photon or a misidentified jet. The $p_{fake}$ function of $p_T$
158 + and $\eta$ is simply obtained by dividing the $p_T$ and $\eta$
159 + distributions for the electron candidate that satisfied ``Simple Tight''
160 + electron identification requirements to that for electron candidates
161 + that satisfied ``Simple Loose''. Such distributions are given
162 + in Fig.~\ref{fig:qcd_zjet_est}.
163  
59 From a sample of multi-jet events triggered by an ``OR'' of multi-jet
60 triggers, we will select loose electron candidates that are not
61 matched to any of the triggering object
62 %we will reject the object matched with the triggering
63 %objects. This will allow us to have a unbiased sample of multi-jet
64 %events. For the purpose of the study, we have used CSA07 Gumbo
65 %samples, with Pythia ID filtering in order to keep only events from
66 %photon+jets, QCD and minimum bias events.
67 The removal of the object matched with the triggering object is done
68 using a matching cone of $\Delta R =0.2$. "Simple Loose" selection is
69 applied to each reconstructed electron from this sample of jets from
70 QCD and photon+jet. Then the tight criteria is applied on such loose
71 electrons and the $p_{fake}$ is simply the ratio of this two
72 population. This ratio, given as a function of $Pt$ and $\eta$, is
73 showed in plots \ref{fig:qcd_zjet_est}.
164  
75 \subsection{Determination of $\epsilon_{tight}$}

Diff Legend

Removed lines
+ Added lines
< Changed lines
> Changed lines