[ViewVC] Diff of: cvsroot/UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex

Comparing UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex (file contents):
Revision 1.8 by beaucero, Sun Jun 22 23:20:41 2008 UTC vs.
Revision 1.10 by ymaravin, Mon Jun 23 16:32:05 2008 UTC

+\section{Signal extraction}
+\label{sec:SignalExt}
-<
+Two kind of background are affected this analysis: background having
-<
+already a $Z$ boson in the final state such as $Z+jets$ and
-<
+$Z+b\bar{b}$, background without a $Z$ boson such as $W+jets$ and
-<
+$t\bar{t}$. The first one will be peaking as the signal in the $Z$
-<
+mass distribution while the second should be flat.As a starting point,
-<
+we will use this properties to separate the two background.
-<
-<
+\subsection{Study of non peaking background}
-<
+In order to measure this background, a fit of the signal and
-<
+background is done. In order to fit the signal peak we use a Gaussian
-<
+convulated with a Breit-Wigner. The background is fitted by a line.
-<
+An example of the fit of the distribution composed by the sum of
-<
+signal and background for the 3 electrons final state is shown on
-<
+figure~\ref{fig:ZFit}.
->
+We separate backgrounds into two categories: one with a genuine \Z boson
->
+from $\Z+jets$ processes, and the other without a genuine \Z boson from
->
+$t\bar{t}$ and $\W+jet$ production. The latter source can be estimated from
->
+the invariant mass of the \Z boson candidate, where the background events
->
+with no genuine \Z boson should not produce a \Z mass peak and should
->
+be relatively smooth.
->
->
+\subsection{Study of the background without a genuine \Z boson}
->
+We estimate the background due to events without a genuine \Z boson from
->
+fitting a \Z candidate invariant mass to a Gaussian function convoluted
->
+with a Breit-Wigner function. The background is parameterized as a straight
->
+line. An example of a fit for $3e$ category is given in Fig.~\ref{fig:ZFit}.
->
+Both number of signal and background events are calculated for the
->
+invariant mass range between 81 and 101 GeV. In Table~\ref{tab:FitVsMC}
->
+we summarize the number of background events obtained from the fit
->
+and from the Monte Carlo truth information.
->
+\begin{figure}[!bp]
+  \begin{center}
+  \scalebox{0.4}{\includegraphics{figs/FitBkg3eTight.eps}}
-<
+  \caption{$Z$ mass distribution which contains the sum of signal and background on which a fit is performed to extract the number of non peaking background events within the 81 GeV and 101GeV.}
->
+  \caption{The invariant mass distribution of the $Z$ boson candidate that is fit to a signal
->
+  parameterized as a Gaussian function convoluted with a Breit-Wigner function and
->
+  a background, parameterized as a straight line.}
+  \label{fig:ZFit}
+  \end{center}
+\end{figure}
-–
+The comparison between the Monte Carlo information and the value
-–
+obtain by the fit for a $Z$ mass range [81,101] GeV is given in
-–
+table~\ref{tab:FitVsMC}.
-–
+\begin{table}[!tb]
+\begin{center}
-–
+\begin{tabular}{|l|c|c|c|c|c|c|c|} \hline
-<
+Channel    & $Z+jets$ & $Zb\bar{b}$ & $t\bar{t}$ & $W+jets$ & $t\bar{t}$ + $W+jets$ & Fit result \\ \hline
->
+                    & \multicolumn{2}{c|}{Background with genuine \Z} & \multicolumn{4}{c|}{Background without
->
+                    genuine \Z boson} \\
->
+Channel    & $\Z+jets$ & $\Z b\bar{b}$ &   $t\bar{t}$ & $\W+jets$ & $t\bar{t}$ + $\W+jets$ & Fit result \\ \hline
+$3e$ Loose & 196.5 & 67.4 & 35.7 & 0 & 35.7& 37.8 \\ \hline
+$3e$ Tight & 78.9 & 38.5 & 28.1 & 0 & 28.1 & 32.9 \\ \hline
+$2\mu 1e$ Loose & 189.6 & 52.6 & 4.7 & 0 & 4.7 & 5.7 \\ \hline
+\end{tabular}
+\end{center}
-<
+\caption{Comparison between monte carlo expectation for the analysis and the results of the fit for the non peaking background. Number of event are integrated between [81,101] GeV. The Loose and Tight criteria apply so far, for final state where $W\rightarrow e\nu$. One has to consider that this study as been perform on a smaller sample than the other part of the analysis a 10\% statistics error as to be counted until the study is performed on the whole samples.
->
+\caption{Comparison between Monte Carlo truth information and the results of the fit for the background without genuine \Z boson. Number of events are obtained in the invariant mass range between 81 and 101 GeV.
->
+%I AM NOT SURE I UNDERSTAND WHAT IS WRITTEN HERE
->
+%The ``Loose'' and ``Tight'' selection criteria applied for $W\rightarrow e\nu$ final state only. One has to consider that this study as been perform on a smaller sample than the other part of the analysis a 10\% statistics error as to be counted until the study is performed on the whole samples.
+\label{tab:FitVsMC}
+\end{table}
-<
+\subsection{Study of peaking background}
->
+\subsection{Estimation of the background with genuine \Z boson}
+\label{sec:D0Matrix}
-<
+The probability to misidentify a jet as a muon is very low while for
-<
+the case of the electron, $\pi^0$ in jets can be misidentified as
-<
+electrons. When considering the subtraction of the background in this
-<
+analysis, we will mainly concentrate on the final state where the $W$
-<
+is decaying to an electron. The same studies is still on going for the
-<
+muon case.
-<
-<
+\subsubsection{Z+jets background fraction}
-<
+The main background remaining even after having applied all our selection
-<
+in the case of the $W$ decaying to electron is the $Z+jets$
-<
+production. As signal and background have a \Z boson in the final
-<
+state, we will concentrate on the third lepton which is an electron in
-<
+this study.
-<
-<
+In order to select the \Z candidate in the events, we apply loose
-<
+criteria. The loose sample contain a given number of signal events
-<
+which contains a third isolated electron and a given number of
-<
+background events which do not contains a third isolated
-<
+electron. When we apply the tight criteria the fraction of signal and
-<
+background events is changing according to the efficiency of the
-<
+criteria. This can be expressed by this formula:
-<
+\begin{eqnarray}
-<
+N_{loose} & = & \hspace*{0.9cm}               N_e +   \hspace*{0.9cm}   N_{j} \\
-<
+N_{tight} & = & \epsilon_{tight} N_e  + p_{fake}  N_{j}
-<
+\end{eqnarray}
-<
+Where $N_{loose}$ and $N_{tight}$ are the numbers of events in
-<
+the loose and tight samples, respectively, $N_e$ is the number of events with a third
-<
+isolated electron, $N_j$ is the number of events without a third
-<
+isolated electron, $\epsilon_{tight}$ is the efficiency of the tight
-<
+criteria on electron, $p_{fake}$ is the probability for a jet identified
-<
+as a loose electron to be also identified as a tight electron.  By
-<
+solving this set of equations we obtain:
-<
+$$
-<
+N_e     = \frac{N_{tight}-p_{fake} N_{loose}} { \epsilon_{tight} -p_{fake}} \ \ \ \mbox{and} \ \ \
-<
+N_{j} = \frac{ \epsilon_{tight} N_{loose} - N_e}{  \epsilon_{tight} -p_{fake}}
-<
+$$
-<
-<
+The estimation of $\epsilon_{tight}$ and $p_{fake}$ can be done using control
-<
+samples, containing electrons and jet respectively with high purity. The Tag and probe
-<
+method is used to determine signal efficiency $\epsilon_{tight}$. The rate
-<
+will be derived from $Z \to e^+e^-$ samples. In the following section we
-<
+describe the method used to determine $p_{fake}$.
->
+All of the instrumental background events with real \Z boson come from
->
+$\Z + jets$ processes where one of the jets is misidentified as a lepton.
->
+The probability to misidentify a jet as a muon is very low in CMS, while that
->
+for the case of electron can be quite high as jets with large electromagnetic
->
+energy fraction can be misidentified as electrons. $\Z+jet$ background
->
+is especially high for \WZ\ signal with $\W\to e\nu$. Thus, it is imperative
->
+to have a reliable estimation of this background from data to avoid
->
+unnecessary systematic uncertainties due to Monte Carlo description of data
->
+in startup conditions. Therefore, in the following we describe the data-driven
->
+estimation of the $\Z+jets$ background for the $\ell^+\ell^- e$
->
+categories. A similar study for the remaining $\ell^+\ell^- \mu$ categories
->
+are in progress. However, as the $\Z+jets$ background is sufficiently small, it is
->
+possible to use Monte Carlo simulation to estimate $\Z+jet$ background with
->
+early data, without incurring significant systematic uncertainty due to data modeling.
->
->
+\subsubsection{$\Z+jets$ background fraction}
->
+To estimate the fraction of the $\Z+jets$ events in data
->
+we apply a method, commonly referred to as ``matrix'' method.
->
+The idea of a method is to apply ``Loose'' identification criteria
->
+on the third lepton after \Z boson candidate is identified
->
+and count the number of the observed events, $N_{loose}$.
->
+These events contain events with real electrons $N_{e}$
->
+and events with misidentified jets $N_j$:
->
+\begin{equation}
->
+\label{eq:matrixEq1}
->
+N_{loose} = N_e + N_j.
->
+\end{equation}
->
->
+If we are to apply ``Tight'' selection on the third lepton, the number
->
+of the observed events $N_{tight}$ would change as following
->
+\begin{equation}
->
+\label{eq:matrixEq2}
->
+N_{tight} = \epsilon_{tight} N_e + p_{fake} N_j,
->
+\end{equation}
->
+where $\epsilon_{tight}$ and $p_{fake}$ are efficiency of ``Tight''
->
+criteria with respect to ``Loose'' requirements for electrons and
->
+misidentified jets, respectively. As $N_{loose}$ and $N_{tight}$
->
+are directly observable, to extract the number of $Z+jet$ events
->
+in the final sample, one needs to measure $\epsilon_{tight}$
->
+and $p_{fake}$ in control data samples. Two possible ways
->
+to estimate these values are given below.
->
->
+\subsubsection{Determination of $\epsilon_{tight}$}
->
->
+\begin{figure}[bt]
->
+  \begin{center}
->
+  \scalebox{0.8}{\includegraphics{figs/tag_probe_fit.eps}}
->
+  \caption{Invariant mass of the \Z boson candidate for ``Tight-Tight'' (a)
->
+  and ``Tight-Loose'' (b) electron selections fitted to a Gaussian with
->
+  bifurcated Breit-Wigner functions.}
->
+  \label{fig:tagprobe}
->
+  \end{center}
->
+\end{figure}
-+
+To estimate the $\epsilon_{tight}$ we apply ``tag-and-probe'' method
-+
+using $\Z \to e^+e^-$ from \Z+jets Chowder sample, including  \W+jets
-+
+and $t\bar{t}$ as background. \Z mass distribution is separated for two cases where
-+
+electrons from \Z decay either both pass ``Tigh'' selection (``Tight-Tight'' case), or only
-+
+one passes the ``Tight'' selection, while the other electron passes ``Loose'' but not ``Tight''
-+
+selection (``Tight-Loose'' case). To estimate signal in the selected \Z candidate invariant mass distribution, we fit it to a Gaussisan with bifurcated Breit-Wigner function as a signal
-+
+and straight line for a background model. \Z mass distribution and fit are shown in \ref{fig:tagprobe}.
-+
-+
+Equation for determination of signal efficiency is given as
-+
+\begin{equation}
-+
+\epsilon_{tight}=\frac{ 2*(N_{TT}-B_{TT}) }{ (N_{TL}-B_{TL})+2*(N_{TT}-B_{TT}) }
-+
+\end{equation}
-+
-+
+where $N_{TT}$,$B_{TT}$,$N_{TL}$ and $B_{TL}$ are, respectively, number of signal+background
-+
+and background events for ``Tight-Tight'' and ``Loose-Tight'' electron combinations.
-+
+We estimated an efficiency $\epsilon_{tight}=0.99 \pm  0.01$.
-+
+\subsubsection{Determination of $p_{fake}$}
+As the events will be most of the time triggered by the leptons coming
-<
+from \Z boson, we assume that the third lepton is unbiased toward
-<
+trigger requirement. Ideally we need a sample of pure multi-jet events
->
+from \Z boson, we assume that the third lepton is unbiased toward the
->
+trigger requirement. Ideally, we need a sample of pure multi-jet events
+in order to compute the probability for a jet identified as a loose
+electron to be also identified as a tight electron. In selecting such
+a sample in data, one has to avoid any bias from the trigger
-<
+requirements on the loose electron candidate.
->
+requirements on the ``Loose'' electron candidate.
+%Such sample will not
+%exist in data as they will be bias by the trigger requirement.
+\begin{figure}[bt]
+  \begin{center}
+  \scalebox{0.6}{\includegraphics{figs/tight_eff_gumbo.eps}}
-<
+  \caption{Fraction of electron candidates passing the tight criteria
-<
+    in QCD event. No trigger requirement has been applied.}
->
+  \caption{Fraction of electron candidates passing the ``Tight'' criteria
->
+    in multijet event. No trigger requirement has been applied.}
+  \label{fig:qcd_efftight_noHLT}
+  \end{center}
+\end{figure}
-<
+From a sample of multi-jet events triggered by an ``OR'' of multi-jet
-<
+triggers, we will select loose electron candidates that are not
-<
+matched to any of the triggering object
-<
+%we will reject the object matched with the triggering
-<
+%objects. This will allow us to have a unbiased sample of multi-jet
-<
+%events. For the purpose of the study, we have used CSA07 Gumbo
-<
+%samples, with Pythia ID filtering in order to keep only events from
-<
+%photon+jets, QCD and minimum bias events.
-<
+The removal of the object matched with the triggering object is done
-<
+using a matching cone of $\Delta R =0.2$. "Simple Loose" selection is
-<
+applied to each reconstructed electron from this sample of jets from
-<
+QCD and photon+jet. Then the tight criteria is applied on such loose
-<
+electrons and the $p_{fake}$ is simply the ratio of this two
-<
+population. This ratio, given as a function of $Pt$ and $\eta$, is
-<
+showed in plots \ref{fig:qcd_zjet_est}.
->
+From a sample of multijet events triggered by an ``OR'' of multi-jet
->
+triggers, we select a ``Loose'' electron candidate that are not
->
+matched to any of the trigger objects. We also require the
->
+electron candidate to be separated from the jet that satisfies
->
+the trigger requirement by requiring the candidate to be separated
->
+by at least $\Delta R = 0.2$ from the trigger object.
->
+This allows us to obtain an unbiased sample of multijet events
->
+where an electron candidate is likely to be either a converted
->
+photon or a misidentified jet. The $p_{fake}$ function of $p_T$
->
+and $\eta$ is simply obtained by dividing the $p_T$ and $\eta$
->
+distributions for the electron candidate that satisfied ``Simple Tight''
->
+electron identification requirements to that for electron candidates
->
+that satisfied ``Simple Loose''. Such distributions are given
->
+in Fig.~\ref{fig:qcd_zjet_est}.
->
-–
+\subsubsection{Determination of $\epsilon_{tight}$}
-–
+TO BE WRITTEN... SRECKO???

Diff Legend

-–
+Removed lines
-+
+Added lines
-<
+Changed lines
->
+Changed lines

Comparing UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex (file contents): Revision 1.8 by beaucero, Sun Jun 22 23:20:41 2008 UTC vs. Revision 1.10 by ymaravin, Mon Jun 23 16:32:05 2008 UTC

Diff Legend

Comparing UserCode/Vuko/Notes/WZCSA07/zjetbackground.tex (file contents):
Revision 1.8 by beaucero, Sun Jun 22 23:20:41 2008 UTC vs.
Revision 1.10 by ymaravin, Mon Jun 23 16:32:05 2008 UTC