MitHzz4l/Documentation/Backgrounds.tex

%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
\section{Backgrounds}\label{sec:BG}
%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
This section reviews our evaluation of background in the $4\ell$ analysis.  We discuss expected yields and the predicted $m(4\ell)$ shapes, both of which are used in the limit and sensitivity calculations described in Section~\ref{sec:Extraction}.  We estimate Electroweak (EWK) backgrounds with Monte Carlo.  Our estimates of instrumental and jet backgrounds are data-driven.

%_________________________________________________________________
\subsection{Electroweak Backgrounds}\label{sec:EWK}
%_________________________________________________________________
We use the $ZZ$, $WZ$ and $Z\gamma$ MC samples listed in Table~\ref{tab:MC} to estimate yields and $m(4\ell)$ shapes for these backgrounds.  We correct the acceptances determined from simulation using the procedures described in Section~\ref{sec:Signal}.  We determine background yields using the corrected $4e$, $4\mu$ and $2e2\mu$ acceptances ($\alpha_{c}$) for each process: 

\begin{eqnarray}
N^{exp}_{i}  & = & \alpha^{c}_{i}\int\mathcal{L}\sigma_{i}
\end{eqnarray}

The cross sections used in formula above are taken from Table~\ref{tab:MC}.  Table~\ref{tab:MCBG} lists the $\alpha_{c}$ and the expected $2.1\rm~fb^{-1}$ yields for the diboson backgrounds.  Figure~\ref{fig:MCshapes} shows a yield-normalized stack of the corresponding $m(4\ell)$ distributions.

%-------------------------------------------------
\begin{figure}[htb]
\begin{center}
\includegraphics[width=0.5\linewidth]{figs/HF1.png}
\caption{MC Background Shapes.{\bf put the right plot here} }
\label{fig:MCshapes}
\end{center}
\end{figure}
%-------------------------------------------------

%-------------------------------------------------
\begin{table}[htb]
\begin{center}
\begin{tabular}{c|cc|cc|cc}
{\bf Process}   & $\alpha^{c}_{ee}$  & $N^{exp}_{ee}$ & $\alpha^{c}_{\mu\mu}$  & $N^{exp}_{\mu\mu}$ & $\alpha^{c}_{2e2\mu}$  & $N^{exp}_{2e2\mu}$ \\
\hline
$ZZ^{*}$     &  ~                 & ~              &  ~                 & ~              &  ~                 & ~              \\ 
$WZ$      &  ~                 & ~              &  ~                 & ~              &  ~                 & ~              \\ 
$Z\gamma$ &  ~                 & ~              &  ~                 & ~              &  ~                 & ~              \\ 
\hline
\end{tabular}
\caption{MC Background Yields.}
\label{tab:MCBG}
\end{center}
\end{table}
%-------------------------------------------------

We consider two sources of systematic uncertainties on the EWK background predictions.  The first is due to the uncertainty on the efficiency scale-factors, which we propagate from the tables of Section~\ref{sec:Leptons} to the corrected acceptance for each channel. {\bf still have to do this}.  The second uncertainty concerns the influence of missing higher-orders on the mass shapes and kinematics predicted by the MC.  We estimate the magnitude of this effect by reweighting the POWHEG samples at generator-level to the $m(4\ell)$ distributions predicted by MCFM with renormalization and factorization scales varied by $\times 2$, $/2$.  We take the relative differences in shape are used as an uncertainty in the limit calculation.    Figure~\ref{fig:EWKshapeSys} shows the relative shape differences we obtain after reweighting. {\bf done for ZZ, needed for the others}.

%-------------------------------------------------
\begin{figure}[bht]
\begin{center}
\includegraphics[width=0.5\linewidth]{figs/HF1.png}
\caption{EWK Shape Differences From MCFM Reweight.{\bf get the right plot in here} }
\label{fig:EWKshapeSys}
\end{center}
\end{figure}
%-------------------------------------------------

%_________________________________________________________________
\subsection{Instrumental/Fake Backgrounds}\label{sec:fakes}
%_________________________________________________________________
$Z+jets$ , $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$ backgrounds (collectively, $\ell\ell jj$) contribute to the $4\ell$ signal region when jets in these events are either mismeasured as leptons or produce real leptons through secondary interactions.  These processes are difficult to accurately simulate so we estimate their contribution from data.  We assess $\ell\ell jj$  backgrounds using the ``fakeable object'' technique~\cite{fakeable}.  For this method we define ``fakerates'' with respect to loosely identified lepton candidates, referred to as ``denominator objects''.  Electron and muon denominator selections are defined in Table~\ref{tab:fo}.

%-------------------------------------------------
\begin{table}[htb]
\begin{center}
\begin{tabular}{c|c|c|c}
\multicolumn{2}{c|}{Electron}                    & \multicolumn{2}{|c}{Muon}  \\
\hline
variable               & requirement              & variable   & requirement          \\
\hline
~ &         ~  &                                  $p_{T}$                & $> 5\rm~GeV$            \\
~ &         ~  &                                  type                   & $\rm Global~||~Tracker$  \\
~ &         ~  &                                  $|d_{0}|$              & $< 2\rm~mm$             \\
~ &         ~  &                                  $Iso^{pf}_{0.3}$       & $< 3\times p_{T}$        \\
\end{tabular}
\caption{Denominator Object Definitions}\label{tab:fo}
\end{center}
\end{table}
%-------------------------------------------------

We calculate the fakerates ($\epsilon_{FR}(p_{T},\eta)$) from samples of events that pass single lepton triggers: \verb|HLT_Ele8| for electrons, \verb|HLT_Mu8| or \verb|HLT_Mu13| for muons.  In both channels we reduce contamination from $W\rightarrow \ell\nu$ and $Z/\gamma^{*}\rightarrow\ell\ell$ by vetoing events with $MET > 20\rm~GeV$, or with $m_{T} > 35\rm~GeV$ or with two or more denominator objects of $p_{T} > 10\rm~GeV$.  We enrich the samples in background by selecting only those denominator objects opposite to  ($\Delta R(\eta,\phi) > 1.0$) a reconstructed $p_{T} > 35\rm~GeV$ jet.  Figure~\ref{fig:FR} shows the electron and muon fakerates obtained from this procedure as a function of $p_{T}$. 

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/frMu.png}
\includegraphics[width=0.45\linewidth]{figs/frMu.png}
\caption{ Muon and Electron Fake Rates.}
\label{fig:FR} 
\end{center}
\end{figure}
%-------------------------------------------------

We estimate $\ell\ell jj$ backgrounds in the signal region by applying the fakerates in events that contain a good Z1.  First, we select denominator objects that fail identification/isolation to prevent bias from real leptons.  Next, we loop over pairs of the denominator objects, weight each leg with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$  and apply the Z2 kinematic requirements ($12\rm~GeV < m(Z2) < 120$).  The denominator in the weight term accounts for the fact that the we only consider candidates that fail full lepton selection.  Weighted pairs that pass the Z2 kinematic selection are summed to obtain an estimate of the $\ell\ell jj$ background.  

Table~\ref{tab:fakes} presents $\ell\ell jj$ background estimates for the $2.1\rm~fb^{-1}$ dataset.  We maximize the statistical power of the small $Z1 + \ge 2\rm~denominator$ sample by integrating over the flavor of the $Z1$ leptons and then dividing the $Z1$-inclusive prediction between the $4\ell_{e,\mu}$ and $2\ell_{e,\mu}2\ell_{\mu,e}$ channels.  The division is performed by assuming equal $ee$ and $\mu\mu$ $Z1$ branching ratios and using an acceptance factor ($=\sim1$, measured from inclusive $Z\rightarrow ee,\mu\mu$ yields {\bf need to double check this. 1 seems strange}) to account for efficiency differences in the detection of electrons and muons.

%-------------------------------------------------
\begin{table}[htb]
\begin{center}
\begin{tabular}{c|c}
\hline
\multicolumn{2}{c}{Z1-Inclusive $\ell\ell jj$ Yields} \\
\hline
$Z1 + \mu\mu$      & $0.057 \pm X$ \\
$Z1 + ee$          & $X \pm Y$     \\
\hline
\multicolumn{2}{c}{Per-Channel $\ell\ell jj$ Yields} \\
\hline 
$4\mu$      & $0.044 \pm X$ \\
$4e$        & $X \pm Y$     \\
$2e2\mu$    & $(0.013 + Z) \pm Y$     \\
\hline
\end{tabular}
\caption{Expected $\ell\ell jj$ Events.}
\label{tab:fakes}
\end{center}
\end{table}
%-------------------------------------------------

It is difficult to predict $m(4\ell)$ and kinematic shapes for $\ell\ell jj$ background with the limited number of events containing a good $Z1$ and two failing denominator objects.  Although loosening the denominator and $Z1,2$ selections helps, these requirements must not be made so loose that distributions from the control region no longer resemble those of the signal region.  As an alternative, we study shapes using $Z+jets$, $t\bar{t}$ and $Zb\bar{b}$ MC events that pass our nominal selections.  Figure~\ref{fig:fakeshapes}, for example, shows the cross section normalized $m(4\ell)$ distributions from these processes.

%-------------------------------------------------
\begin{figure}[tbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
\caption{ Predicted $m(4\ell)$ Distributions for $\ell\ell jj$ Events. {\bf this is an old data-driven plot.  Put the MC one here.}}
\label{fig:fakeshapes}
\end{center}
\end{figure}
%-------------------------------------------------

%_________________________________________________________________
\subsubsection{Cross Check and Systematics: Light Flavor }\label{sec:lflavor}
%_________________________________________________________________
We cross-check our procedures by predicting the number of fake leptons in independent control regions enriched in light flavor.  We require one $p_{T} > 25\rm~GeV$ lepton candidate that passes our nominal lepton selection and $1+$ same-sign, same-flavor denominator objects.  We veto events with $m(\ell\ell)$ between $76-106\rm~GeV$ to reduce real lepton contamination from Z decays.  

In the muon-channel this selection produces a sample of pure background, of which the primary component is $W+jet$ with a jet faking a muon.  The smaller multi-jet backgrounds, consisting of both light and heavy flavor, contain at least two jets that both fake muons.  We reduce the heavy flavor contribution in this sample by requiring $|\sigma(IP_{3D})/IP_{3D} < 3|$ for all muon candidates and $MET > 25\rm~GeV$.  Relative abundances for events in which the denominator muon passes selection are determined by fitting the resulting MET distribution with a same-sign MC template for $W+jets$ and a Rayleigh distribution for multi-jets.  The fit result (Figure~\ref{fig:ssMuon}, left) indicates that $W+jets$ constitutes $\sim80\%$ of the sample.  Residual contributions from heavy flavor in the same-sign muon sample are therefore small.

%-------------------------------------------------
\begin{figure}[htb]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/ssMuMET.png}
\includegraphics[width=0.45\linewidth]{figs/ssMuMZ1.png}
\caption{Fakerate Predictions for Same-sign Muon Events.}\label{fig:ssMuon} 
\end{center}
\end{figure}
%-------------------------------------------------

Next, we attempt to predict the number of events containing two identified and isolated same-sign muons by applying our fakerates to denominator objects that fail selection.  We loop over all such objects, weight each with the appropriate factor of $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and sum.  The expected and observed $m(\ell\ell)$ distributions are shown in the rightmost plot of Figure~\ref{fig:ssMuon}.  The shape of the predicted distribution agrees with the observation, however the yield is under-predicted by $47.2\%$.  

%This difference can be understood as a result of differences in the composition of the prediction sample (mainly light flavor) and that used to measure the fakerate (a mix of light and heavy flavor).  

{\bf For electrons ...}
%For electrons, charge misidentification is significant enough to result in a noticible Z-peak.  The jet background is however easily estimated from a fit with a same-sign MC Z template and an exponential background PDF.  Events selected in data are shown in Figures~\ref{fig:ssMuon} and (\ref{fig:ssEle}) as points.  Table~\ref{tab:ssfakes} lists the total number of observed events in the muon-channel and the electron-channel background determined from the fit.

%-------------------------------------------------
\begin{figure}[htb]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/ssMuMET}
\includegraphics[width=0.45\linewidth]{figs/ssMuMZ1}
\caption{ Fakerate Predictions for Same-sign Electron Events. {\bf Plots are currently for muons ... }}
\label{fig:ssEle} 
\end{center}
\end{figure}
%-------------------------------------------------

Table summarizes the results of this section.  We take $47.2\%$ ($X\%$) as the systematic uncertainty on the muon (electron) fakerate to account for potential biases in our prediction due to differences in light flavor composition.

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{c|c|c|c}
\hline
channel                 & observed & predicted  & systematic     \\
\hline
${\rm same~sign} \mu\mu$      & $159$    & $108.04$  & $47.2\%$\\
${\rm same~sign} ee$          & $X$      & $Y$       &  $Z\%$ \\
\hline
\end{tabular}
\caption{Same-sign Control Yields and Systematic}
\label{tab:ssfakes}
\end{center}
\end{table}
%-------------------------------------------------

%_________________________________________________________________
\subsubsection{Cross Check and Systematics : Heavy Flavor }\label{sec:hflavor}
%_________________________________________________________________
Backgrounds from $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ involve real leptons from heavy flavor decays.  As with light flavor, a difference in the fraction of heavy flavor in the fakerate and prediction samples can lead to errors in signal region background estimation.  We assess the impact of heavy flavor composition differences by applying our fakerate in a sample of relatively pure $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$.

The control region consists of events that contain a pair of leptons passing the $Z1$ selection and at least two additional denominator objects with $\sigma(IP_{3D})/IP_{3D} > 4$.  Denominators are defined according to the requirements of Table~\ref{tab:fo}.  We make no requirement on denominator charge or flavor.  The leftmost plot of Figure~\ref{fig:ZHF} compares the observed $m(Z1)$ distributions for events passing this selection in data with cross section normalized predictions from MC.  We observe $71$ events and predict $66.3 \pm 2.0$ with $Zb\bar{b}$ and $t\bar{t}$ MC, which confirms that the data sample is indeed dominated by heavy flavor. {\bf update numbers, they're like 62 and 58 now ...}

Next, we require the high-IP denominator objects to additionally pass the more stringent lepton ID and isolation criteria used in our nominal Z2 selection.  We estimate $0.81 \pm 0.21$ events from MC and observe 2.  Electron and muon fakerates are then applied to the denominator objects in the original $71$ events and, following the procedures described in Section~\ref{sec:lflavor}, we predict $0.84 \pm 0.10$ events.  Given the consistent results, we assign no additional systematic uncertainty on our predicted  $\ell\ell jj$ background yields.

%We then reinstate the $\sigma_{IP_{3D}}/IP_{3D} < 4$ cut and estimate $2.5 \pm 0.4$ events in the signal region from the $Zb\bar{b}$ and $t\bar{t}$ MC.  We take this prediction as an estimate of the heavy flavor contribution to our overall $\ell\ell jj$ background esimtate of $XXX$.  We assign a s sysmatic uncertainty on the estimated fraction Considering the We assignconsider th

%-------------------------------------------------
\begin{figure}[htbp]
\begin{center}
\includegraphics[width=0.45\linewidth]{figs/HFmZ1.png}
\includegraphics[width=0.45\linewidth]{figs/HFm4l.png}
\caption{$m(Z1)$ and $m(4\ell)$ in the Heavy Flavor control region.}\label{fig:ZHF} 
\end{center}
\end{figure}
%-------------------------------------------------

%We determine a shape for heavy flavor background in the signal region from the distribution of $m(4\ell)$ from the $Z1 + 2\times$ denominator events.  The rightmost plot of Figure~\ref{fig:ZHF} compares the $m(4\ell)$ distributions for this selection in data and (cross-section normalized) simulation.  We fit both distributions with Landaus and compare the normalized PDFs in Figure~\ref{fig:HFshape}.  

%-------------------------------------------------
%\begin{figure}[htbp]
%\begin{center}
%\includegraphics[width=0.5\linewidth]{figs/HFshape.png}
%\caption{$m(4\ell)$ shapes in the Heavy Flavor control region.}\label{fig:HFshape} 
%\end{center}
%\end{figure}
%-------------------------------------------------
%_________________________________________________________________
\subsection{Cross Check and Systematics: $WZ$  }
%_________________________________________________________________
The estimate of $WZ$ background in Table~\ref{tab:MCBG} is entirely MC-based.  In addition to the leptons from $W$ and $Z$ decay, an additional ``fake'' lepton is needed for this process to contribute in the $4\ell$ signal region.  We cross-check MC predictions with an estimate obtained from the fakeable object method.  

We begin by requiring three fully selected leptons (two from the Z1 plus one additional) and $1+$ denominator objects.  We then perform a single loop to associate the denominator objects with the third lepton.  As before, we weight the denominators with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$, apply opposite-sign, same-flavor and kinematic selections and sum.  The additional, identified lepton with which the denominators are paired is either a fake (from $Z+jets$) or a real lepton (from $WZ$ or $ZZ$ where one of the leptons is not reconstructed).  In order to extract the $WZ$ component of the measurement, we need to subtract off the $3\ell$ contribution predicted by MC for $ZZ$ as well as the double-fake estimate described in Section~\ref{sec:fakes}.  The latter is double-counted when performing a single denominator loop. 

\begin{eqnarray}
 N(WZ) &=& \ell\ell\ell~\Sigma_{i=0}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}   \\
     ~ &-& 2\times \ell\ell~\Sigma_{i=0}^{Nd}\Sigma_{j=i+1}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}~\frac{\epsilon(\eta^{j},p_{T}^{j})}{1-\epsilon(\eta^{j},p_{T}^{j})} \\
     ~ &-& N(WZ) 
\end{eqnarray}

Table~\ref{tab:WZfake} lists values for the terms in the equation above.  The result ... {\bf fill in the table and quote a systematic for WZ}. 

%-------------------------------------------------
\begin{table}[tbh]
\begin{center}
\begin{tabular}{|c|c|c|}
\hline
$4e$    &         $4\mu$  &       $2e2\mu$ \\
\hline
$X\pm Y$ &         $Z\pm Y$ &       $Z\pm Y$  \\
\hline
\end{tabular}
\caption{Data-driven Expected $WZ$ Yields}
\label{tab:WZfake}
\end{center}
\end{table}
%-------------------------------------------------


Revision:	1.3
Committed:	Tue Nov 8 11:18:58 2011 UTC (13 years, 6 months ago) by khahn
Content type:	application/x-tex
Branch:	MAIN
Changes since 1.2:	+64 -94 lines
Log Message:	stuff corresponding to rough draft
#	User	Rev	Content
1	khahn	1.1	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
2	khahn	1.3	\section{Backgrounds}\label{sec:BG}
3	khahn	1.1	%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
4			This section reviews our evaluation of background in the $4\ell$ analysis. We discuss expected yields and the predicted $m(4\ell)$ shapes, both of which are used in the limit and sensitivity calculations described in Section~\ref{sec:Extraction}. We estimate Electroweak (EWK) backgrounds with Monte Carlo. Our estimates of instrumental and jet backgrounds are data-driven.
5
6			%_________________________________________________________________
7			\subsection{Electroweak Backgrounds}\label{sec:EWK}
8			%_________________________________________________________________
9	khahn	1.3	We use the $ZZ$, $WZ$ and $Z\gamma$ MC samples listed in Table~\ref{tab:MC} to estimate yields and $m(4\ell)$ shapes for these backgrounds. We correct the acceptances determined from simulation using the procedures described in Section~\ref{sec:Signal}. We determine background yields using the corrected $4e$, $4\mu$ and $2e2\mu$ acceptances ($\alpha_{c}$) for each process:
10	khahn	1.1
11			\begin{eqnarray}
12	khahn	1.3	N^{exp}_{i} & = & \alpha^{c}_{i}\int\mathcal{L}\sigma_{i}
13	khahn	1.1	\end{eqnarray}
14
15	khahn	1.3	The cross sections used in formula above are taken from Table~\ref{tab:MC}. Table~\ref{tab:MCBG} lists the $\alpha_{c}$ and the expected $2.1\rm~fb^{-1}$ yields for the diboson backgrounds. Figure~\ref{fig:MCshapes} shows a yield-normalized stack of the corresponding $m(4\ell)$ distributions.
16
17			%-------------------------------------------------
18			\begin{figure}[htb]
19			\begin{center}
20			\includegraphics[width=0.5\linewidth]{figs/HF1.png}
21			\caption{MC Background Shapes.{\bf put the right plot here} }
22			\label{fig:MCshapes}
23			\end{center}
24			\end{figure}
25			%-------------------------------------------------
26	khahn	1.1
27			%-------------------------------------------------
28			\begin{table}[htb]
29			\begin{center}
30			\begin{tabular}{c\|cc\|cc\|cc}
31	khahn	1.3	{\bf Process} & $\alpha^{c}_{ee}$ & $N^{exp}_{ee}$ & $\alpha^{c}_{\mu\mu}$ & $N^{exp}_{\mu\mu}$ & $\alpha^{c}_{2e2\mu}$ & $N^{exp}_{2e2\mu}$ \\
32	khahn	1.1	\hline
33	khahn	1.3	$ZZ^{*}$ & ~ & ~ & ~ & ~ & ~ & ~ \\
34	khahn	1.1	$WZ$ & ~ & ~ & ~ & ~ & ~ & ~ \\
35			$Z\gamma$ & ~ & ~ & ~ & ~ & ~ & ~ \\
36			\hline
37			\end{tabular}
38	khahn	1.3	\caption{MC Background Yields.}
39			\label{tab:MCBG}
40	khahn	1.1	\end{center}
41			\end{table}
42			%-------------------------------------------------
43
44	khahn	1.3	We consider two sources of systematic uncertainties on the EWK background predictions. The first is due to the uncertainty on the efficiency scale-factors, which we propagate from the tables of Section~\ref{sec:Leptons} to the corrected acceptance for each channel. {\bf still have to do this}. The second uncertainty concerns the influence of missing higher-orders on the mass shapes and kinematics predicted by the MC. We estimate the magnitude of this effect by reweighting the POWHEG samples at generator-level to the $m(4\ell)$ distributions predicted by MCFM with renormalization and factorization scales varied by $\times 2$, $/2$. We take the relative differences in shape are used as an uncertainty in the limit calculation. Figure~\ref{fig:EWKshapeSys} shows the relative shape differences we obtain after reweighting. {\bf done for ZZ, needed for the others}.
45	khahn	1.1
46			%-------------------------------------------------
47			\begin{figure}[bht]
48			\begin{center}
49			\includegraphics[width=0.5\linewidth]{figs/HF1.png}
50	khahn	1.3	\caption{EWK Shape Differences From MCFM Reweight.{\bf get the right plot in here} }
51			\label{fig:EWKshapeSys}
52	khahn	1.1	\end{center}
53			\end{figure}
54			%-------------------------------------------------
55
56			%_________________________________________________________________
57			\subsection{Instrumental/Fake Backgrounds}\label{sec:fakes}
58			%_________________________________________________________________
59	khahn	1.3	$Z+jets$ , $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$ backgrounds (collectively, $\ell\ell jj$) contribute to the $4\ell$ signal region when jets in these events are either mismeasured as leptons or produce real leptons through secondary interactions. These processes are difficult to accurately simulate so we estimate their contribution from data. We assess $\ell\ell jj$ backgrounds using the ``fakeable object'' technique~\cite{fakeable}. For this method we define ``fakerates'' with respect to loosely identified lepton candidates, referred to as ``denominator objects''. Electron and muon denominator selections are defined in Table~\ref{tab:fo}.
60	khahn	1.1
61			%-------------------------------------------------
62			\begin{table}[htb]
63			\begin{center}
64	khahn	1.3	\begin{tabular}{c\|c\|c\|c}
65			\multicolumn{2}{c\|}{Electron} & \multicolumn{2}{\|c}{Muon} \\
66	khahn	1.1	\hline
67			variable & requirement & variable & requirement \\
68			\hline
69	khahn	1.3	~ & ~ & $p_{T}$ & $> 5\rm~GeV$ \\
70			~ & ~ & type & $\rm Global~\|\|~Tracker$ \\
71			~ & ~ & $\|d_{0}\|$ & $< 2\rm~mm$ \\
72			~ & ~ & $Iso^{pf}_{0.3}$ & $< 3\times p_{T}$ \\
73	khahn	1.1	\end{tabular}
74			\caption{Denominator Object Definitions}\label{tab:fo}
75			\end{center}
76			\end{table}
77			%-------------------------------------------------
78
79			We calculate the fakerates ($\epsilon_{FR}(p_{T},\eta)$) from samples of events that pass single lepton triggers: \verb\|HLT_Ele8\| for electrons, \verb\|HLT_Mu8\| or \verb\|HLT_Mu13\| for muons. In both channels we reduce contamination from $W\rightarrow \ell\nu$ and $Z/\gamma^{*}\rightarrow\ell\ell$ by vetoing events with $MET > 20\rm~GeV$, or with $m_{T} > 35\rm~GeV$ or with two or more denominator objects of $p_{T} > 10\rm~GeV$. We enrich the samples in background by selecting only those denominator objects opposite to ($\Delta R(\eta,\phi) > 1.0$) a reconstructed $p_{T} > 35\rm~GeV$ jet. Figure~\ref{fig:FR} shows the electron and muon fakerates obtained from this procedure as a function of $p_{T}$.
80
81			%-------------------------------------------------
82			\begin{figure}[tbp]
83			\begin{center}
84			\includegraphics[width=0.45\linewidth]{figs/frMu.png}
85	khahn	1.3	\includegraphics[width=0.45\linewidth]{figs/frMu.png}
86			\caption{ Muon and Electron Fake Rates.}
87			\label{fig:FR}
88	khahn	1.1	\end{center}
89			\end{figure}
90			%-------------------------------------------------
91
92			We estimate $\ell\ell jj$ backgrounds in the signal region by applying the fakerates in events that contain a good Z1. First, we select denominator objects that fail identification/isolation to prevent bias from real leptons. Next, we loop over pairs of the denominator objects, weight each leg with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and apply the Z2 kinematic requirements ($12\rm~GeV < m(Z2) < 120$). The denominator in the weight term accounts for the fact that the we only consider candidates that fail full lepton selection. Weighted pairs that pass the Z2 kinematic selection are summed to obtain an estimate of the $\ell\ell jj$ background.
93
94	khahn	1.3	Table~\ref{tab:fakes} presents $\ell\ell jj$ background estimates for the $2.1\rm~fb^{-1}$ dataset. We maximize the statistical power of the small $Z1 + \ge 2\rm~denominator$ sample by integrating over the flavor of the $Z1$ leptons and then dividing the $Z1$-inclusive prediction between the $4\ell_{e,\mu}$ and $2\ell_{e,\mu}2\ell_{\mu,e}$ channels. The division is performed by assuming equal $ee$ and $\mu\mu$ $Z1$ branching ratios and using an acceptance factor ($=\sim1$, measured from inclusive $Z\rightarrow ee,\mu\mu$ yields {\bf need to double check this. 1 seems strange}) to account for efficiency differences in the detection of electrons and muons.
95	khahn	1.1
96			%-------------------------------------------------
97			\begin{table}[htb]
98			\begin{center}
99			\begin{tabular}{c\|c}
100			\hline
101			\multicolumn{2}{c}{Z1-Inclusive $\ell\ell jj$ Yields} \\
102			\hline
103			$Z1 + \mu\mu$ & $0.057 \pm X$ \\
104			$Z1 + ee$ & $X \pm Y$ \\
105			\hline
106			\multicolumn{2}{c}{Per-Channel $\ell\ell jj$ Yields} \\
107			\hline
108			$4\mu$ & $0.044 \pm X$ \\
109			$4e$ & $X \pm Y$ \\
110			$2e2\mu$ & $(0.013 + Z) \pm Y$ \\
111			\hline
112			\end{tabular}
113	khahn	1.3	\caption{Expected $\ell\ell jj$ Events.}
114			\label{tab:fakes}
115	khahn	1.1	\end{center}
116			\end{table}
117			%-------------------------------------------------
118
119	khahn	1.3	It is difficult to predict $m(4\ell)$ and kinematic shapes for $\ell\ell jj$ background with the limited number of events containing a good $Z1$ and two failing denominator objects. Although loosening the denominator and $Z1,2$ selections helps, these requirements must not be made so loose that distributions from the control region no longer resemble those of the signal region. As an alternative, we study shapes using $Z+jets$, $t\bar{t}$ and $Zb\bar{b}$ MC events that pass our nominal selections. Figure~\ref{fig:fakeshapes}, for example, shows the cross section normalized $m(4\ell)$ distributions from these processes.
120	khahn	1.1
121			%-------------------------------------------------
122			\begin{figure}[tbp]
123			\begin{center}
124			\includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
125	khahn	1.3	\caption{ Predicted $m(4\ell)$ Distributions for $\ell\ell jj$ Events. {\bf this is an old data-driven plot. Put the MC one here.}}
126			\label{fig:fakeshapes}
127	khahn	1.1	\end{center}
128			\end{figure}
129			%-------------------------------------------------
130
131			%_________________________________________________________________
132			\subsubsection{Cross Check and Systematics: Light Flavor }\label{sec:lflavor}
133			%_________________________________________________________________
134			We cross-check our procedures by predicting the number of fake leptons in independent control regions enriched in light flavor. We require one $p_{T} > 25\rm~GeV$ lepton candidate that passes our nominal lepton selection and $1+$ same-sign, same-flavor denominator objects. We veto events with $m(\ell\ell)$ between $76-106\rm~GeV$ to reduce real lepton contamination from Z decays.
135
136			In the muon-channel this selection produces a sample of pure background, of which the primary component is $W+jet$ with a jet faking a muon. The smaller multi-jet backgrounds, consisting of both light and heavy flavor, contain at least two jets that both fake muons. We reduce the heavy flavor contribution in this sample by requiring $\|\sigma(IP_{3D})/IP_{3D} < 3\|$ for all muon candidates and $MET > 25\rm~GeV$. Relative abundances for events in which the denominator muon passes selection are determined by fitting the resulting MET distribution with a same-sign MC template for $W+jets$ and a Rayleigh distribution for multi-jets. The fit result (Figure~\ref{fig:ssMuon}, left) indicates that $W+jets$ constitutes $\sim80\%$ of the sample. Residual contributions from heavy flavor in the same-sign muon sample are therefore small.
137
138			%-------------------------------------------------
139			\begin{figure}[htb]
140			\begin{center}
141			\includegraphics[width=0.45\linewidth]{figs/ssMuMET.png}
142			\includegraphics[width=0.45\linewidth]{figs/ssMuMZ1.png}
143			\caption{Fakerate Predictions for Same-sign Muon Events.}\label{fig:ssMuon}
144			\end{center}
145			\end{figure}
146			%-------------------------------------------------
147
148			Next, we attempt to predict the number of events containing two identified and isolated same-sign muons by applying our fakerates to denominator objects that fail selection. We loop over all such objects, weight each with the appropriate factor of $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and sum. The expected and observed $m(\ell\ell)$ distributions are shown in the rightmost plot of Figure~\ref{fig:ssMuon}. The shape of the predicted distribution agrees with the observation, however the yield is under-predicted by $47.2\%$.
149
150			%This difference can be understood as a result of differences in the composition of the prediction sample (mainly light flavor) and that used to measure the fakerate (a mix of light and heavy flavor).
151
152	khahn	1.3	{\bf For electrons ...}
153	khahn	1.1	%For electrons, charge misidentification is significant enough to result in a noticible Z-peak. The jet background is however easily estimated from a fit with a same-sign MC Z template and an exponential background PDF. Events selected in data are shown in Figures~\ref{fig:ssMuon} and (\ref{fig:ssEle}) as points. Table~\ref{tab:ssfakes} lists the total number of observed events in the muon-channel and the electron-channel background determined from the fit.
154
155			%-------------------------------------------------
156			\begin{figure}[htb]
157			\begin{center}
158	khahn	1.3	\includegraphics[width=0.45\linewidth]{figs/ssMuMET}
159			\includegraphics[width=0.45\linewidth]{figs/ssMuMZ1}
160			\caption{ Fakerate Predictions for Same-sign Electron Events. {\bf Plots are currently for muons ... }}
161			\label{fig:ssEle}
162	khahn	1.1	\end{center}
163			\end{figure}
164			%-------------------------------------------------
165
166			Table summarizes the results of this section. We take $47.2\%$ ($X\%$) as the systematic uncertainty on the muon (electron) fakerate to account for potential biases in our prediction due to differences in light flavor composition.
167
168			%-------------------------------------------------
169			\begin{table}[tbh]
170			\begin{center}
171	khahn	1.3	\begin{tabular}{c\|c\|c\|c}
172	khahn	1.1	\hline
173			channel & observed & predicted & systematic \\
174			\hline
175	khahn	1.3	${\rm same~sign} \mu\mu$ & $159$ & $108.04$ & $47.2\%$\\
176			${\rm same~sign} ee$ & $X$ & $Y$ & $Z\%$ \\
177	khahn	1.1	\hline
178			\end{tabular}
179	khahn	1.3	\caption{Same-sign Control Yields and Systematic}
180			\label{tab:ssfakes}
181	khahn	1.1	\end{center}
182			\end{table}
183			%-------------------------------------------------
184
185			%_________________________________________________________________
186			\subsubsection{Cross Check and Systematics : Heavy Flavor }\label{sec:hflavor}
187			%_________________________________________________________________
188	khahn	1.3	Backgrounds from $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ involve real leptons from heavy flavor decays. As with light flavor, a difference in the fraction of heavy flavor in the fakerate and prediction samples can lead to errors in signal region background estimation. We assess the impact of heavy flavor composition differences by applying our fakerate in a sample of relatively pure $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$.
189	khahn	1.1
190	khahn	1.3	The control region consists of events that contain a pair of leptons passing the $Z1$ selection and at least two additional denominator objects with $\sigma(IP_{3D})/IP_{3D} > 4$. Denominators are defined according to the requirements of Table~\ref{tab:fo}. We make no requirement on denominator charge or flavor. The leftmost plot of Figure~\ref{fig:ZHF} compares the observed $m(Z1)$ distributions for events passing this selection in data with cross section normalized predictions from MC. We observe $71$ events and predict $66.3 \pm 2.0$ with $Zb\bar{b}$ and $t\bar{t}$ MC, which confirms that the data sample is indeed dominated by heavy flavor. {\bf update numbers, they're like 62 and 58 now ...}
191	khahn	1.1
192			Next, we require the high-IP denominator objects to additionally pass the more stringent lepton ID and isolation criteria used in our nominal Z2 selection. We estimate $0.81 \pm 0.21$ events from MC and observe 2. Electron and muon fakerates are then applied to the denominator objects in the original $71$ events and, following the procedures described in Section~\ref{sec:lflavor}, we predict $0.84 \pm 0.10$ events. Given the consistent results, we assign no additional systematic uncertainty on our predicted $\ell\ell jj$ background yields.
193
194			%We then reinstate the $\sigma_{IP_{3D}}/IP_{3D} < 4$ cut and estimate $2.5 \pm 0.4$ events in the signal region from the $Zb\bar{b}$ and $t\bar{t}$ MC. We take this prediction as an estimate of the heavy flavor contribution to our overall $\ell\ell jj$ background esimtate of $XXX$. We assign a s sysmatic uncertainty on the estimated fraction Considering the We assignconsider th
195
196			%-------------------------------------------------
197			\begin{figure}[htbp]
198			\begin{center}
199			\includegraphics[width=0.45\linewidth]{figs/HFmZ1.png}
200			\includegraphics[width=0.45\linewidth]{figs/HFm4l.png}
201			\caption{$m(Z1)$ and $m(4\ell)$ in the Heavy Flavor control region.}\label{fig:ZHF}
202			\end{center}
203			\end{figure}
204			%-------------------------------------------------
205
206	khahn	1.3	%We determine a shape for heavy flavor background in the signal region from the distribution of $m(4\ell)$ from the $Z1 + 2\times$ denominator events. The rightmost plot of Figure~\ref{fig:ZHF} compares the $m(4\ell)$ distributions for this selection in data and (cross-section normalized) simulation. We fit both distributions with Landaus and compare the normalized PDFs in Figure~\ref{fig:HFshape}.
207	khahn	1.1
208			%-------------------------------------------------
209	khahn	1.3	%\begin{figure}[htbp]
210			%\begin{center}
211			%\includegraphics[width=0.5\linewidth]{figs/HFshape.png}
212			%\caption{$m(4\ell)$ shapes in the Heavy Flavor control region.}\label{fig:HFshape}
213			%\end{center}
214			%\end{figure}
215	khahn	1.1	%-------------------------------------------------
216			%_________________________________________________________________
217			\subsection{Cross Check and Systematics: $WZ$ }
218			%_________________________________________________________________
219	khahn	1.3	The estimate of $WZ$ background in Table~\ref{tab:MCBG} is entirely MC-based. In addition to the leptons from $W$ and $Z$ decay, an additional ``fake'' lepton is needed for this process to contribute in the $4\ell$ signal region. We cross-check MC predictions with an estimate obtained from the fakeable object method.
220	khahn	1.1
221	khahn	1.3	We begin by requiring three fully selected leptons (two from the Z1 plus one additional) and $1+$ denominator objects. We then perform a single loop to associate the denominator objects with the third lepton. As before, we weight the denominators with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$, apply opposite-sign, same-flavor and kinematic selections and sum. The additional, identified lepton with which the denominators are paired is either a fake (from $Z+jets$) or a real lepton (from $WZ$ or $ZZ$ where one of the leptons is not reconstructed). In order to extract the $WZ$ component of the measurement, we need to subtract off the $3\ell$ contribution predicted by MC for $ZZ$ as well as the double-fake estimate described in Section~\ref{sec:fakes}. The latter is double-counted when performing a single denominator loop.
222	khahn	1.1
223			\begin{eqnarray}
224	khahn	1.3	N(WZ) &=& \ell\ell\ell~\Sigma_{i=0}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})} \\
225			~ &-& 2\times \ell\ell~\Sigma_{i=0}^{Nd}\Sigma_{j=i+1}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}~\frac{\epsilon(\eta^{j},p_{T}^{j})}{1-\epsilon(\eta^{j},p_{T}^{j})} \\
226			~ &-& N(WZ)
227	khahn	1.1	\end{eqnarray}
228
229	khahn	1.3	Table~\ref{tab:WZfake} lists values for the terms in the equation above. The result ... {\bf fill in the table and quote a systematic for WZ}.
230	khahn	1.1
231			%-------------------------------------------------
232			\begin{table}[tbh]
233			\begin{center}
234			\begin{tabular}{\|c\|c\|c\|}
235			\hline
236			$4e$ & $4\mu$ & $2e2\mu$ \\
237			\hline
238			$X\pm Y$ & $Z\pm Y$ & $Z\pm Y$ \\
239			\hline
240			\end{tabular}
241	khahn	1.3	\caption{Data-driven Expected $WZ$ Yields}
242			\label{tab:WZfake}
243	khahn	1.1	\end{center}
244			\end{table}
245			%-------------------------------------------------
246
247