MitHzz4l/Documentation/Backgrounds.save.tex

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Backgrounds}\label{section:BG}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
This section reviews our estimation of background in the $4\ell$ analysis.  We discuss expected yields and the predicted $m(4\ell)$ shapes for the various background channels, both of which are used in the limit calculation described in Section~\ref{sec:limit}.  Irreducible EWK backgrounds are estimated with Monte Carlo.  Our estimates of ``fake'' backgrounds associated with $Z+jet$ and heavy flavor backgrounds from $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$ are data-driven.

%_________________________________________________________________
\subsection{Electroweak Backgrounds}\label{sec:EWK}
%_________________________________________________________________
We use the $ZZ \rightarrow 4\ell$, $WZ \rightarrow 3\ell$ and $Z\gamma$ MC samples listed in Table~\ref{tab:MC} to estimate the yields and shapes of these backgrounds.  Each of the MC acceptances ($\alpha$) we determine from the MC are efficiency-corrected following the same procedure described for signal in Section~\ref{section:signalEff}.  This provides a corrected $\alpha_{c}$ for each process that we use to predict yields : 

\begin{eqnarray}
N^{exp}_{i}  & = & \alpha^{c}_{i}\int\mathcal{L}(k_{i}\sigma_{i})
\end{eqnarray}

The cross sections and K-factors used in formula above are taken from Table~\ref{tab:xsec}.  Table~\ref{tab;MCBG} lists the corrected acceptances and expected yields for each of the irreducible backgrounds.  Figure~\ref{fig:MCshapes} shows a yield-normalized histogram stack of the corresponding $m(4\ell)$ distributions.

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|}
\hline
process         & $\alpha^{c}$    & $N^{exp}$ \\
\hline
$ZZ*$           &  ~               & ~ \\ 
$WZ$            &  ~               & ~ \\ 
$Z\gamma$       &  ~               & ~ \\ 
\hline
\end{tabular}
\caption{{\bf MC Background Yields.}\small{blah.}\label{tab:MCBG}}
\end{center}
\end{table}
%-------------------------------------------------

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.5\linewidth]{figs/HF1.png}
\caption{MC Background Shapes.\label{fig:MCshapes} }
\end{center}
\end{figure}
%-------------------------------------------------

Our current estimate of $WZ$ background is entirely MC-based.  In addition to the leptons from $W$ and $Z$ decay, an additional ``fake'' lepton is needed for this process to contribute in the $4\ell$ signal region.  Given the difficulty in simulating processes that result in fake leptons, we perform an additional data-driven estimate of background using the ``fakeable object'' method described below.  The method is described in Appendix~\ref{app:WZfake}.  Results from the procedure given in Table~\ref{tab:WZfake}.  The data-driven estimates are consistent with MC predictions within uncertainties.  

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|}
\hline
$4e$    &         $4\mu$  &       $2e2\mu$ \\
\hline
$X\pm Y$ &         $Z\pm Y$ &       $Z\pm Y$  \\
\hline
\end{tabular}
\caption{{\bf Data-driven Expected $WZ$ Yields.}\small{blah.}\label{tab:WZfake}}
\end{center}
\end{table}
%-------------------------------------------------


%_________________________________________________________________
\subsection{Instrumental/Fake Backgrounds}\label{sec:Zj}
%_________________________________________________________________
We estimate $Z + jets$, $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$  backgrounds using the ``fakeable object'' technique~\cite{fakeable}.  We define fakerates with respect to loosely identified lepton candidates, which are henceforth referred to as {\it denominator objects}.  Our electron and muon denominator definitions are given in Table~\ref{tab:fo}.

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|c|}
\hline
\multicolumn{2}{|c|}{Electron}                    & \multicolumn{2}{|c|}{Muon}  \\
\hline
variable               & requirement              & variable   & requirement          \\
\hline
~ &         ~  &                                  $p_{T}$                & $> 5\rm~GeV$            \\
~ &         ~  &                                  type                   & $\rm Global~||~Tracker$  \\
~ &         ~  &                                  $|d_{0}|$              & $< 2\rm~cm$             \\
~ &         ~  &                                  $\sigma(p_{T})/p_{T}$  & $< 0.4$                 \\
~ &         ~  &                                  $Iso^{pf}_{0.3}$       & $< 0.7$                 \\
\hline
\end{tabular}
\caption{{\bf Denominator Object Definitions.}\small{blah.}\label{tab:fo}}
\end{center}
\end{table}
%-------------------------------------------------

We calculate fakerates ($\epsilon_{FR}(p_{T},\eta)$) from samples of events that pass a combination of single and double lepton triggers (XXX for electrons, \verb|HLT_Mu8| or \verb|HLT_Mu13| for muons).  In both cases we veto events with $MET > 20\rm~GeV$ or $m_{T} > 35\rm~GeV$ or that contain two or more denominator objects with $p_{T} > 10\rm~GeV$ to reduce contributions from W's and Z's.  The samples are enriched in background by selecting only those denominator objects opposite ($\Delta R(\eta,\phi) > 1.0$) a reconstructed $p_{T} > 35\rm~GeV$ jet.  Figure~\ref{fig:FR} shows the electron and muon fakerates determined by this procedure as a function of $p_{T}$.  

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/frMu.png}
\includegraphics[width=0.45\linewidth]{figs/HF1.png}
\caption{ {\bf Muon and Electron Fake Rates.}\label{fig:FR} }
\end{center}
\end{figure}
%-------------------------------------------------

We estimate the contribution of the $\ell\elljj$ backgrounds in our signal region by applying the fakerates in events that contain a good Z1.  We select denominator objects that fail our full lepton identification/isolation selection to prevent bias from real leptons.  We then loop over pairs of denominator objects, weight each leg with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$  and apply the Z2 kinematic requirements ($12\rm~GeV < m(Z2) < 120$) on the pair.  The denominator in the weight term accounts for the fact the we only consider denominator objects that fail full lepton selection.  The weighted pairs that pass the Z2 kinematic selection are summed to provide an estimate of the $\ell\ell\jj$ background.  Table~\ref{tab:fakes} lists the number of Z1 + $2\times~fakes$ we determine for the $1.6~\rm fb{-1}$ dataset.  

%\begin{eqnarray}
% \Sigma_{i=0}^{Nd}~\Sigma_{j=i+1}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}~\frac{\epsilon(\eta^{j},p_{T}^{j})}{1-\epsilon(\eta^{j},p_{T}^{j})} &=&  N(Z+j) 
%\end{eqnarray}

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|}
\hline
\multicolumn{2}{|c|}{Z1-Inclusive $Z+jets$ Yields} \\
\hline
$Z1 + \mu\mu$      & $0.057 \pm X$ \\
$Z1 + ee$          & $X \pm Y$     \\
\hline
\multicolumn{2}{|c|}{Final-State $Z+jets$ Yields} \\
\hline 
$4\mu$      & $0.044 \pm X$ \\
$4e$        & $X \pm Y$     \\
$2e2\mu$    & $(0.013 + Z) \pm Y$     \\
\hline
\end{tabular}
\caption{{\bf Expected Fakes.}\label{tab:fakes}}
\end{center}
\end{table}
%-------------------------------------------------


%In the second calculation, we require three fully selected leptons (two from the Z1 plus one additional) and then perform a single loop that associates the denominator objects with additional lepton.  As before, we weight the demoninators with $\epsilon_{FR}(p_{T},\eta)$, apply kinematic selection and sum.  The additional lepton with which the denominators are paired is either a fake (Z+jets) or real (WZ+jets) lepton.  In the case where the additional lepton is fake over-represented because each jet leg has a probability of $\epsilon_{FR}$ to passing the tight selection.to jets The sum includes both contributions, counting picks a  This sum double counts the contributions from are summed    in our signal region to 

%\begin{eqnarray}
% \ell~\Sigma_{i=0}^{Nd}~\epsilon(\eta^{i},p_{T}^{i}) &=&  2\times N(Z+j) +  N(WZ) 
%\end{eqnarray}

It is difficult to obtain a reliable estimate of the $Z + jets$ $m(4\ell)$ distribution with the limited number of events containing a good Z1 and two denominator objects.  We loosen the denominator and Z2 selections to better understand the shape.  For the muon-channel, we relax the isolation requirement in the denominator definition and remove the opposite-sign requirement in Z2 selection.  For electrons we {\bf XXX}.  With these modifications we obtain the $m(4\ell)$ distributions shown Figure~\ref{fig:fakeshapes}.  We fit the shapes with Landau distributions and obtain acceptable values for goodness-of-fit.  Consequently, we also choose Landau distributions for modeling the $m(4l)$ distribution of our $Z+jets$ predictions, as is shown in Figure~\ref{fig:fakeshapes}.  We take differences with respect to the high-statistics distributions as a systematic uncertainties on our predicted shapes.

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
\includegraphics[width=0.45\linewidth]{figs/muFakeShape-2m.png}
\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
\caption{$\gamma_d$ Branching Ratios.\label{fig:fakeshapes} }
\end{center}
\end{figure}
%-------------------------------------------------

\subsection{Cross Check : Light Flavor }

We cross-check our fakerates and procedures by attempting to predict the number of fakes contributing in an independent control region.  We require one lepton candidate that passes our nominal lepton selection and $1+$ opposite-sign, same-flavor denominator objects.  In the muon-channel, this selection provides a sample of nearly pure background, of which the primary contribution is $W+jet$ where the jet fakes a lepton.  For electrons, charge misidentification is significant enough to result in a noticeable Z-peak.  However, the contribution of fakes in the electron sample is easily estimated from a fit with a same-sign MC Z template and an exponential  background PDF.  Events selected in data are shown in Figures~\ref{fig:ssMuon} and (\ref{fig:ssEle}) as points.  Table~\ref{tab:ssfakes} lists the total number of observed events in the muon-channel and the electron-channel background estimated from the fit.

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/ssMuon-m4l.png}
\includegraphics[width=0.45\linewidth]{figs/ssMuon-pt.png}
\caption{{\bf Fakerate validation with same-sign muon events.}\small{blah.}\label{fig:ssMuon} }
\end{center}
\end{figure}
%-------------------------------------------------

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/ssMuon-m4l.png}
\includegraphics[width=0.45\linewidth]{figs/ssMuon-pt.png}
\caption{{\bf Fakerate validation with same-sign electron events.}\small{blah.}\label{fig:ssEle} }
\end{center}
\end{figure}
%-------------------------------------------------

Next, we predict the number of same-sign fake events in these samples by applying our fakerates to the denominator objects.  We loop over all denominator objects in these events that fail full lepton selection, weight each with the appropriate factor of $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and sum.  The expected $m(\ell\ell)$ and $p_{T}$ distributions we obtain from this procedure are shown as solid histograms in Figures~\ref{fig:ssMuon} and ~\ref{fig:ssEle}.  Predicted yields are compared with observation in Table~\ref{tab:ssfakes}.  The shape of the observed $m(\ell\ell)$ distribution is well described by our prediction.  The normalizations agree to within $6\%$ (muons) and $X\%$ (electrons).  {\bf Shouldn't take every cross check as a systematic ... but should we take this one?  Assign it to sample depend somehow?}.

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c||c|c|}
\hline
channel                 & observed & predicted      \\
\hline
\hline
$same sign~\mu\mu$      & $1741$    & $1642.6 \pm X$  \\
$same sign~ee$          & $X$      & $Z \pm Y$  \\
\hline
\end{tabular}
\caption{{\bf Same-sign Control Yields.}\label{tab:ssfakes}}
\end{center}
\end{table}
%-------------------------------------------------

\subsection{Cross Check : Heavy Flavor }

\subsection{Cross Check : W/Z  }

%_________________________________________________________________
\subsection{$t\bar{t}$, $Zb\bar{b}$, $Zc\bar{c}$}\label{sec:HF}
%_________________________________________________________________
Background from $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ involves real leptons from heavy flavor decays.  While lepton multiplicity, decay length and kinematics are correctly simulated ({\bf reference?}), we can not rely on MC to accurately model the identification and isolation efficiency of leptons embedded in heavy flavor jets.  We address this by determining a set of data-driven efficiency scale factors, $f_{HF}$, that we use these to correct MC predictions for $t\bar{t}$ and $Zb\bar{b}$, $Zc\bar{c}$ 4-lepton backgrounds.  

We calculate the efficiency scale factors using simulated $Zb\bar{b}$ and $t\bar{t}$ samples and a large-IP control region in data.  The control region consists of events that contain a pair of preselected leptons passing our Z1 selection and at least two additional reconstructed ({\it i.e.:} denominator) lepton candidates with $\sigma_{IP} > 5$.  We do not require the denominators to be opposite-charge, same-flavor or isolated.  

The left plot of Figure~\ref{fig:ZHF} compares the predicted and observed $m(Z1)$ distributions from the control regions in data and MC for events passing the denominator selection described above.  Both shape and normalization agree; we observe $98$ events and predict $91.1 \pm XXX$.  We then require the high-IP lepton candidates to additionally pass the more stringent lepton ID and isolation criteria used in our nominal Z2 selection.  We predict $0.80 \pm XXX$ events and observe 2.  Table~\ref{tab:HFSF} lists the predicted and observed number of individual Z2 muon/electron candidates that pass the denominator and Z2 lepton selections\footnote{Note that values in the ``pass'' column are calculated for {\it single} leptons, not pairs.}  We calculate per-lepton efficiencies from the fraction of denominators that pass the Z2 requirements.  Per-lepton efficiency scale factors $f_{HF}$ are determined from the ratio of data and MC efficiencies. 

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/HF1.png}
\includegraphics[width=0.45\linewidth]{figs/HF1.png}
\caption{$\gamma_d$ Branching Ratios.\label{fig:ZHF} }
\end{center}
\end{figure}
%-------------------------------------------------

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|ccc|ccc|c|}
\hline
~ & \multicolumn{3}{|c|}{Monte Carlo}  & \multicolumn{3}{|c|}{Data} & ~                           \\
\hline
type     & denom    & pass   & $\epsilon~(\%)$  & denom & pass & $\epsilon~(\%)$   & $f_{HF}$     \\
\hline
$e$      & $34.0$   & $1.9$  &  $5.6 \pm 3.9$   & $43$  & $4$  &  $9.3 \pm 4.4$    & $1.67 \pm X$  \\
$\mu$    & $148.3$  & $8.0$  &  $5.4 \pm 1.9$   & $153$ & $12$ &  $7.8 \pm 2.2$    & $1.46 \rm X$  \\
\hline
\end{tabular}
\caption{Muon Isolation.\label{tab:HFSF}}
\end{center}
\end{table}
%-------------------------------------------------

To a large extent lepton isolation and identification are uncorrelated with impact parameter.  Therefore, we can use $f_{HF}$ to correct MC predictions for the number of heavy flavor events with low-IP leptons that pass our full event selection.  First, we select events in MC with a good Z1 and two $\sigma(IP) < 4$ lepton candidates satisfying the denominator requirements given above.  Next, the candidates are required to be opposite-charge and same flavor with $m(Z2) > 12\rm~GeV$, consistent with a good Z2.  Estimates of heavy flavor background can be simply obtained by applying full lepton identification and isolation requirements to the denominator leptons.  To reduce statistical uncertainties, we categorize the predictions according to Z2 lepton flavor and ignore the flavor of Z1.  These results are given in the first column of Table~\ref{tab:HFyield}.  

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|}
\hline
channel       & uncorrected      & corrected       \\
\hline
$ee$          & $0.130 \pm 0.X$  & $0.364 \pm 0.X$ \\
$\mu\mu$      & $0.095 \pm 0.X$  & $0.203 \pm 0.X$ \\
\hline
\end{tabular}
\caption{(Un)Corrected Z2 Yields.\label{tab:HFyield}}
\end{center}
\end{table}
%-------------------------------------------------

Assuming an equal number of electron and muon-channel Z1 decays and accounting for data/MC efficiency differences appropriate for each Z1 flavor ({\bf do this!!!}), we obtain the uncorrected predictions for the individual $4e$, $\mu4$ and $2e2\mu$ channels listed in Table~\ref{tab:HFyield2}.  To obtain corrected predictions, we weight the total Z2 yields with two powers of the relevant $f_{HF}$ and repeat the procedure.  Corrected $4e$, $\mu4$ and $2e2\mu$ yields are listed in the second column of Table~\ref{tab:HFyield2}.  

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|}
\hline
channel     & uncorrected      & corrected \\
\hline
$4e$        & $0.065 \pm 0.X$  & $0.182 \pm 0.X$   \\
$4\mu$      & $0.047 \pm 0.X$  & $0.102 \pm 0.X$   \\
$2e2\mu$    & $0.113 \pm 0.X$  & $0.295 \pm 0.X$   \\
\hline                                         
total       & $0.225 \pm 0.X$  & $0.579 \pm 0.X$   \\
\hline
\end{tabular}
\caption{(Un)Corrected Heavy Flavor Yields.\label{tab:HFyield2}}
\end{center}
\end{table}
%-------------------------------------------------

We determine a shape for heavy flavor background in the signal region from the distribution of $m(4\ell)$ in the high-IP control region.  Neither data nor MC contain a sufficient number events with Z2 leptons that pass ID and isolation for us to directly predict the shape, so we instead take the Z1 + $2\times$ denominator shape from data as a template.  Figure~\ref{fig:HFshape} compares the $m(4\ell)$ distribution for this selection in data and MC.  The figure MC distributions are  cross-section normalized and correspond to the $91.1$ events mentioned above.  The leftmost plot shows that the predicted $m(4\ell)$ shape is not a good representation of the shape in data.  The observed difference is consistent with a simulated energy scale that is too small for electrons below $20~\rm GeV$ (see Figure~\ref{fig:HFshapeScaled}).  We choose the shape from data as our signal region template and assign a systematic shape uncertainties based on the observed deviation with the MC shape.

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.5\linewidth]{figs/m4l-HF.png}
\caption{Heavy Flavor $m(4\ell)$ Shape.\label{fig:HFshape} }
\end{center}
\end{figure}
%-------------------------------------------------

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.5\linewidth]{figs/m4l-HF.png}
\caption{Heavy Flavor $m(4\ell)$ Shape.\label{fig:HFshapeScaled} }
\end{center}
\end{figure}
%-------------------------------------------------
Revision:	1.2
Committed:	Tue Nov 8 11:18:58 2011 UTC (13 years, 6 months ago) by khahn
Content type:	application/x-tex
Branch:	MAIN
CVS Tags:	compiled, synced_FSR_2, synced_FSR, synched2, synched, AN490, HEAD
Changes since 1.1:	+10 -10 lines
Log Message:	stuff corresponding to rough draft
#	User	Rev	Content
1	khahn	1.1	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
2			\section{Backgrounds}\label{section:BG}
3			%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
4			This section reviews our estimation of background in the $4\ell$ analysis. We discuss expected yields and the predicted $m(4\ell)$ shapes for the various background channels, both of which are used in the limit calculation described in Section~\ref{sec:limit}. Irreducible EWK backgrounds are estimated with Monte Carlo. Our estimates of ``fake'' backgrounds associated with $Z+jet$ and heavy flavor backgrounds from $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$ are data-driven.
5
6			%_________________________________________________________________
7			\subsection{Electroweak Backgrounds}\label{sec:EWK}
8			%_________________________________________________________________
9			We use the $ZZ \rightarrow 4\ell$, $WZ \rightarrow 3\ell$ and $Z\gamma$ MC samples listed in Table~\ref{tab:MC} to estimate the yields and shapes of these backgrounds. Each of the MC acceptances ($\alpha$) we determine from the MC are efficiency-corrected following the same procedure described for signal in Section~\ref{section:signalEff}. This provides a corrected $\alpha_{c}$ for each process that we use to predict yields :
10
11			\begin{eqnarray}
12			N^{exp}_{i} & = & \alpha^{c}_{i}\int\mathcal{L}(k_{i}\sigma_{i})
13			\end{eqnarray}
14
15	khahn	1.2	The cross sections and K-factors used in formula above are taken from Table~\ref{tab:xsec}. Table~\ref{tab;MCBG} lists the corrected acceptances and expected yields for each of the irreducible backgrounds. Figure~\ref{fig:MCshapes} shows a yield-normalized histogram stack of the corresponding $m(4\ell)$ distributions.
16	khahn	1.1
17			%-------------------------------------------------
18			\begin{table}[tbh]
19			\begin{center}
20			\begin{tabular}{\|c\|c\|c\|}
21			\hline
22			process & $\alpha^{c}$ & $N^{exp}$ \\
23			\hline
24			$ZZ*$ & ~ & ~ \\
25			$WZ$ & ~ & ~ \\
26			$Z\gamma$ & ~ & ~ \\
27			\hline
28			\end{tabular}
29			\caption{{\bf MC Background Yields.}\small{blah.}\label{tab:MCBG}}
30			\end{center}
31			\end{table}
32			%-------------------------------------------------
33
34			%-------------------------------------------------
35			\begin{figure}[tbp]
36			\begin{center}
37			\includegraphics[width=0.5\linewidth]{figs/HF1.png}
38			\caption{MC Background Shapes.\label{fig:MCshapes} }
39			\end{center}
40			\end{figure}
41			%-------------------------------------------------
42
43	khahn	1.2	Our current estimate of $WZ$ background is entirely MC-based. In addition to the leptons from $W$ and $Z$ decay, an additional ``fake'' lepton is needed for this process to contribute in the $4\ell$ signal region. Given the difficulty in simulating processes that result in fake leptons, we perform an additional data-driven estimate of background using the ``fakeable object'' method described below. The method is described in Appendix~\ref{app:WZfake}. Results from the procedure given in Table~\ref{tab:WZfake}. The data-driven estimates are consistent with MC predictions within uncertainties.
44	khahn	1.1
45			%-------------------------------------------------
46			\begin{table}[tbh]
47			\begin{center}
48			\begin{tabular}{\|c\|c\|c\|}
49			\hline
50			$4e$ & $4\mu$ & $2e2\mu$ \\
51			\hline
52			$X\pm Y$ & $Z\pm Y$ & $Z\pm Y$ \\
53			\hline
54			\end{tabular}
55			\caption{{\bf Data-driven Expected $WZ$ Yields.}\small{blah.}\label{tab:WZfake}}
56			\end{center}
57			\end{table}
58			%-------------------------------------------------
59
60
61			%_________________________________________________________________
62			\subsection{Instrumental/Fake Backgrounds}\label{sec:Zj}
63			%_________________________________________________________________
64			We estimate $Z + jets$, $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ backgrounds using the ``fakeable object'' technique~\cite{fakeable}. We define fakerates with respect to loosely identified lepton candidates, which are henceforth referred to as {\it denominator objects}. Our electron and muon denominator definitions are given in Table~\ref{tab:fo}.
65
66			%-------------------------------------------------
67			\begin{table}[tbh]
68			\begin{center}
69			\begin{tabular}{\|c\|c\|c\|c\|}
70			\hline
71			\multicolumn{2}{\|c\|}{Electron} & \multicolumn{2}{\|c\|}{Muon} \\
72			\hline
73			variable & requirement & variable & requirement \\
74			\hline
75			~ & ~ & $p_{T}$ & $> 5\rm~GeV$ \\
76			~ & ~ & type & $\rm Global~\|\|~Tracker$ \\
77			~ & ~ & $\|d_{0}\|$ & $< 2\rm~cm$ \\
78			~ & ~ & $\sigma(p_{T})/p_{T}$ & $< 0.4$ \\
79			~ & ~ & $Iso^{pf}_{0.3}$ & $< 0.7$ \\
80			\hline
81			\end{tabular}
82			\caption{{\bf Denominator Object Definitions.}\small{blah.}\label{tab:fo}}
83			\end{center}
84			\end{table}
85			%-------------------------------------------------
86
87			We calculate fakerates ($\epsilon_{FR}(p_{T},\eta)$) from samples of events that pass a combination of single and double lepton triggers (XXX for electrons, \verb\|HLT_Mu8\| or \verb\|HLT_Mu13\| for muons). In both cases we veto events with $MET > 20\rm~GeV$ or $m_{T} > 35\rm~GeV$ or that contain two or more denominator objects with $p_{T} > 10\rm~GeV$ to reduce contributions from W's and Z's. The samples are enriched in background by selecting only those denominator objects opposite ($\Delta R(\eta,\phi) > 1.0$) a reconstructed $p_{T} > 35\rm~GeV$ jet. Figure~\ref{fig:FR} shows the electron and muon fakerates determined by this procedure as a function of $p_{T}$.
88
89			%-------------------------------------------------
90			\begin{figure}[tbp]
91			\begin{center}
92			\includegraphics[width=0.45\linewidth]{figs/frMu.png}
93			\includegraphics[width=0.45\linewidth]{figs/HF1.png}
94			\caption{ {\bf Muon and Electron Fake Rates.}\label{fig:FR} }
95			\end{center}
96			\end{figure}
97			%-------------------------------------------------
98
99	khahn	1.2	We estimate the contribution of the $\ell\elljj$ backgrounds in our signal region by applying the fakerates in events that contain a good Z1. We select denominator objects that fail our full lepton identification/isolation selection to prevent bias from real leptons. We then loop over pairs of denominator objects, weight each leg with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and apply the Z2 kinematic requirements ($12\rm~GeV < m(Z2) < 120$) on the pair. The denominator in the weight term accounts for the fact the we only consider denominator objects that fail full lepton selection. The weighted pairs that pass the Z2 kinematic selection are summed to provide an estimate of the $\ell\ell\jj$ background. Table~\ref{tab:fakes} lists the number of Z1 + $2\times~fakes$ we determine for the $1.6~\rm fb{-1}$ dataset.
100	khahn	1.1
101			%\begin{eqnarray}
102			% \Sigma_{i=0}^{Nd}~\Sigma_{j=i+1}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}~\frac{\epsilon(\eta^{j},p_{T}^{j})}{1-\epsilon(\eta^{j},p_{T}^{j})} &=& N(Z+j)
103			%\end{eqnarray}
104
105			%-------------------------------------------------
106			\begin{table}[tbh]
107			\begin{center}
108			\begin{tabular}{\|c\|c\|}
109			\hline
110			\multicolumn{2}{\|c\|}{Z1-Inclusive $Z+jets$ Yields} \\
111			\hline
112			$Z1 + \mu\mu$ & $0.057 \pm X$ \\
113			$Z1 + ee$ & $X \pm Y$ \\
114			\hline
115			\multicolumn{2}{\|c\|}{Final-State $Z+jets$ Yields} \\
116			\hline
117			$4\mu$ & $0.044 \pm X$ \\
118			$4e$ & $X \pm Y$ \\
119			$2e2\mu$ & $(0.013 + Z) \pm Y$ \\
120			\hline
121			\end{tabular}
122			\caption{{\bf Expected Fakes.}\label{tab:fakes}}
123			\end{center}
124			\end{table}
125			%-------------------------------------------------
126
127
128			%In the second calculation, we require three fully selected leptons (two from the Z1 plus one additional) and then perform a single loop that associates the denominator objects with additional lepton. As before, we weight the demoninators with $\epsilon_{FR}(p_{T},\eta)$, apply kinematic selection and sum. The additional lepton with which the denominators are paired is either a fake (Z+jets) or real (WZ+jets) lepton. In the case where the additional lepton is fake over-represented because each jet leg has a probability of $\epsilon_{FR}$ to passing the tight selection.to jets The sum includes both contributions, counting picks a This sum double counts the contributions from are summed in our signal region to
129
130			%\begin{eqnarray}
131			% \ell~\Sigma_{i=0}^{Nd}~\epsilon(\eta^{i},p_{T}^{i}) &=& 2\times N(Z+j) + N(WZ)
132			%\end{eqnarray}
133
134	khahn	1.2	It is difficult to obtain a reliable estimate of the $Z + jets$ $m(4\ell)$ distribution with the limited number of events containing a good Z1 and two denominator objects. We loosen the denominator and Z2 selections to better understand the shape. For the muon-channel, we relax the isolation requirement in the denominator definition and remove the opposite-sign requirement in Z2 selection. For electrons we {\bf XXX}. With these modifications we obtain the $m(4\ell)$ distributions shown Figure~\ref{fig:fakeshapes}. We fit the shapes with Landau distributions and obtain acceptable values for goodness-of-fit. Consequently, we also choose Landau distributions for modeling the $m(4l)$ distribution of our $Z+jets$ predictions, as is shown in Figure~\ref{fig:fakeshapes}. We take differences with respect to the high-statistics distributions as a systematic uncertainties on our predicted shapes.
135	khahn	1.1
136			%-------------------------------------------------
137			\begin{figure}[tbp]
138			\begin{center}
139			\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
140			\includegraphics[width=0.45\linewidth]{figs/muFakeShape-2m.png}
141			\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
142			\caption{$\gamma_d$ Branching Ratios.\label{fig:fakeshapes} }
143			\end{center}
144			\end{figure}
145			%-------------------------------------------------
146
147			\subsection{Cross Check : Light Flavor }
148
149	khahn	1.2	We cross-check our fakerates and procedures by attempting to predict the number of fakes contributing in an independent control region. We require one lepton candidate that passes our nominal lepton selection and $1+$ opposite-sign, same-flavor denominator objects. In the muon-channel, this selection provides a sample of nearly pure background, of which the primary contribution is $W+jet$ where the jet fakes a lepton. For electrons, charge misidentification is significant enough to result in a noticeable Z-peak. However, the contribution of fakes in the electron sample is easily estimated from a fit with a same-sign MC Z template and an exponential background PDF. Events selected in data are shown in Figures~\ref{fig:ssMuon} and (\ref{fig:ssEle}) as points. Table~\ref{tab:ssfakes} lists the total number of observed events in the muon-channel and the electron-channel background estimated from the fit.
150	khahn	1.1
151			%-------------------------------------------------
152			\begin{figure}[tbp]
153			\begin{center}
154			\includegraphics[width=0.45\linewidth]{figs/ssMuon-m4l.png}
155			\includegraphics[width=0.45\linewidth]{figs/ssMuon-pt.png}
156			\caption{{\bf Fakerate validation with same-sign muon events.}\small{blah.}\label{fig:ssMuon} }
157			\end{center}
158			\end{figure}
159			%-------------------------------------------------
160
161			%-------------------------------------------------
162			\begin{figure}[tbp]
163			\begin{center}
164			\includegraphics[width=0.45\linewidth]{figs/ssMuon-m4l.png}
165			\includegraphics[width=0.45\linewidth]{figs/ssMuon-pt.png}
166			\caption{{\bf Fakerate validation with same-sign electron events.}\small{blah.}\label{fig:ssEle} }
167			\end{center}
168			\end{figure}
169			%-------------------------------------------------
170
171	khahn	1.2	Next, we predict the number of same-sign fake events in these samples by applying our fakerates to the denominator objects. We loop over all denominator objects in these events that fail full lepton selection, weight each with the appropriate factor of $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and sum. The expected $m(\ell\ell)$ and $p_{T}$ distributions we obtain from this procedure are shown as solid histograms in Figures~\ref{fig:ssMuon} and ~\ref{fig:ssEle}. Predicted yields are compared with observation in Table~\ref{tab:ssfakes}. The shape of the observed $m(\ell\ell)$ distribution is well described by our prediction. The normalizations agree to within $6\%$ (muons) and $X\%$ (electrons). {\bf Shouldn't take every cross check as a systematic ... but should we take this one? Assign it to sample depend somehow?}.
172	khahn	1.1
173			%-------------------------------------------------
174			\begin{table}[tbh]
175			\begin{center}
176			\begin{tabular}{\|c\|\|c\|c\|}
177			\hline
178			channel & observed & predicted \\
179			\hline
180			\hline
181			$same sign~\mu\mu$ & $1741$ & $1642.6 \pm X$ \\
182			$same sign~ee$ & $X$ & $Z \pm Y$ \\
183			\hline
184			\end{tabular}
185			\caption{{\bf Same-sign Control Yields.}\label{tab:ssfakes}}
186			\end{center}
187			\end{table}
188			%-------------------------------------------------
189
190			\subsection{Cross Check : Heavy Flavor }
191
192			\subsection{Cross Check : W/Z }
193
194			%_________________________________________________________________
195			\subsection{$t\bar{t}$, $Zb\bar{b}$, $Zc\bar{c}$}\label{sec:HF}
196			%_________________________________________________________________
197	khahn	1.2	Background from $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ involves real leptons from heavy flavor decays. While lepton multiplicity, decay length and kinematics are correctly simulated ({\bf reference?}), we can not rely on MC to accurately model the identification and isolation efficiency of leptons embedded in heavy flavor jets. We address this by determining a set of data-driven efficiency scale factors, $f_{HF}$, that we use these to correct MC predictions for $t\bar{t}$ and $Zb\bar{b}$, $Zc\bar{c}$ 4-lepton backgrounds.
198	khahn	1.1
199			We calculate the efficiency scale factors using simulated $Zb\bar{b}$ and $t\bar{t}$ samples and a large-IP control region in data. The control region consists of events that contain a pair of preselected leptons passing our Z1 selection and at least two additional reconstructed ({\it i.e.:} denominator) lepton candidates with $\sigma_{IP} > 5$. We do not require the denominators to be opposite-charge, same-flavor or isolated.
200
201	khahn	1.2	The left plot of Figure~\ref{fig:ZHF} compares the predicted and observed $m(Z1)$ distributions from the control regions in data and MC for events passing the denominator selection described above. Both shape and normalization agree; we observe $98$ events and predict $91.1 \pm XXX$. We then require the high-IP lepton candidates to additionally pass the more stringent lepton ID and isolation criteria used in our nominal Z2 selection. We predict $0.80 \pm XXX$ events and observe 2. Table~\ref{tab:HFSF} lists the predicted and observed number of individual Z2 muon/electron candidates that pass the denominator and Z2 lepton selections\footnote{Note that values in the ``pass'' column are calculated for {\it single} leptons, not pairs.} We calculate per-lepton efficiencies from the fraction of denominators that pass the Z2 requirements. Per-lepton efficiency scale factors $f_{HF}$ are determined from the ratio of data and MC efficiencies.
202	khahn	1.1
203			%-------------------------------------------------
204			\begin{figure}[tbp]
205			\begin{center}
206			\includegraphics[width=0.45\linewidth]{figs/HF1.png}
207			\includegraphics[width=0.45\linewidth]{figs/HF1.png}
208			\caption{$\gamma_d$ Branching Ratios.\label{fig:ZHF} }
209			\end{center}
210			\end{figure}
211			%-------------------------------------------------
212
213			%-------------------------------------------------
214			\begin{table}[tbh]
215			\begin{center}
216			\begin{tabular}{\|c\|ccc\|ccc\|c\|}
217			\hline
218			~ & \multicolumn{3}{\|c\|}{Monte Carlo} & \multicolumn{3}{\|c\|}{Data} & ~ \\
219			\hline
220			type & denom & pass & $\epsilon~(\%)$ & denom & pass & $\epsilon~(\%)$ & $f_{HF}$ \\
221			\hline
222			$e$ & $34.0$ & $1.9$ & $5.6 \pm 3.9$ & $43$ & $4$ & $9.3 \pm 4.4$ & $1.67 \pm X$ \\
223			$\mu$ & $148.3$ & $8.0$ & $5.4 \pm 1.9$ & $153$ & $12$ & $7.8 \pm 2.2$ & $1.46 \rm X$ \\
224			\hline
225			\end{tabular}
226			\caption{Muon Isolation.\label{tab:HFSF}}
227			\end{center}
228			\end{table}
229			%-------------------------------------------------
230
231			To a large extent lepton isolation and identification are uncorrelated with impact parameter. Therefore, we can use $f_{HF}$ to correct MC predictions for the number of heavy flavor events with low-IP leptons that pass our full event selection. First, we select events in MC with a good Z1 and two $\sigma(IP) < 4$ lepton candidates satisfying the denominator requirements given above. Next, the candidates are required to be opposite-charge and same flavor with $m(Z2) > 12\rm~GeV$, consistent with a good Z2. Estimates of heavy flavor background can be simply obtained by applying full lepton identification and isolation requirements to the denominator leptons. To reduce statistical uncertainties, we categorize the predictions according to Z2 lepton flavor and ignore the flavor of Z1. These results are given in the first column of Table~\ref{tab:HFyield}.
232
233			%-------------------------------------------------
234			\begin{table}[tbh]
235			\begin{center}
236			\begin{tabular}{\|c\|c\|c\|}
237			\hline
238			channel & uncorrected & corrected \\
239			\hline
240			$ee$ & $0.130 \pm 0.X$ & $0.364 \pm 0.X$ \\
241			$\mu\mu$ & $0.095 \pm 0.X$ & $0.203 \pm 0.X$ \\
242			\hline
243			\end{tabular}
244			\caption{(Un)Corrected Z2 Yields.\label{tab:HFyield}}
245			\end{center}
246			\end{table}
247			%-------------------------------------------------
248
249	khahn	1.2	Assuming an equal number of electron and muon-channel Z1 decays and accounting for data/MC efficiency differences appropriate for each Z1 flavor ({\bf do this!!!}), we obtain the uncorrected predictions for the individual $4e$, $\mu4$ and $2e2\mu$ channels listed in Table~\ref{tab:HFyield2}. To obtain corrected predictions, we weight the total Z2 yields with two powers of the relevant $f_{HF}$ and repeat the procedure. Corrected $4e$, $\mu4$ and $2e2\mu$ yields are listed in the second column of Table~\ref{tab:HFyield2}.
250	khahn	1.1
251			%-------------------------------------------------
252			\begin{table}[tbh]
253			\begin{center}
254			\begin{tabular}{\|c\|c\|c\|}
255			\hline
256			channel & uncorrected & corrected \\
257			\hline
258			$4e$ & $0.065 \pm 0.X$ & $0.182 \pm 0.X$ \\
259			$4\mu$ & $0.047 \pm 0.X$ & $0.102 \pm 0.X$ \\
260			$2e2\mu$ & $0.113 \pm 0.X$ & $0.295 \pm 0.X$ \\
261			\hline
262			total & $0.225 \pm 0.X$ & $0.579 \pm 0.X$ \\
263			\hline
264			\end{tabular}
265			\caption{(Un)Corrected Heavy Flavor Yields.\label{tab:HFyield2}}
266			\end{center}
267			\end{table}
268			%-------------------------------------------------
269
270	khahn	1.2	We determine a shape for heavy flavor background in the signal region from the distribution of $m(4\ell)$ in the high-IP control region. Neither data nor MC contain a sufficient number events with Z2 leptons that pass ID and isolation for us to directly predict the shape, so we instead take the Z1 + $2\times$ denominator shape from data as a template. Figure~\ref{fig:HFshape} compares the $m(4\ell)$ distribution for this selection in data and MC. The figure MC distributions are cross-section normalized and correspond to the $91.1$ events mentioned above. The leftmost plot shows that the predicted $m(4\ell)$ shape is not a good representation of the shape in data. The observed difference is consistent with a simulated energy scale that is too small for electrons below $20~\rm GeV$ (see Figure~\ref{fig:HFshapeScaled}). We choose the shape from data as our signal region template and assign a systematic shape uncertainties based on the observed deviation with the MC shape.
271	khahn	1.1
272			%-------------------------------------------------
273			\begin{figure}[tbp]
274			\begin{center}
275			\includegraphics[width=0.5\linewidth]{figs/m4l-HF.png}
276			\caption{Heavy Flavor $m(4\ell)$ Shape.\label{fig:HFshape} }
277			\end{center}
278			\end{figure}
279			%-------------------------------------------------
280
281			%-------------------------------------------------
282			\begin{figure}[tbp]
283			\begin{center}
284			\includegraphics[width=0.5\linewidth]{figs/m4l-HF.png}
285			\caption{Heavy Flavor $m(4\ell)$ Shape.\label{fig:HFshapeScaled} }
286			\end{center}
287			\end{figure}
288			%-------------------------------------------------