ViewVC Help
View File | Revision Log | Show Annotations | Root Listing
root/cvsroot/UserCode/MitHzz4l/Documentation/Backgrounds.tex
Revision: 1.3
Committed: Tue Nov 8 11:18:58 2011 UTC (13 years, 6 months ago) by khahn
Content type: application/x-tex
Branch: MAIN
Changes since 1.2: +64 -94 lines
Log Message:
stuff corresponding to rough draft

File Contents

# User Rev Content
1 khahn 1.1 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
2 khahn 1.3 \section{Backgrounds}\label{sec:BG}
3 khahn 1.1 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
4     This section reviews our evaluation of background in the $4\ell$ analysis. We discuss expected yields and the predicted $m(4\ell)$ shapes, both of which are used in the limit and sensitivity calculations described in Section~\ref{sec:Extraction}. We estimate Electroweak (EWK) backgrounds with Monte Carlo. Our estimates of instrumental and jet backgrounds are data-driven.
5    
6     %_________________________________________________________________
7     \subsection{Electroweak Backgrounds}\label{sec:EWK}
8     %_________________________________________________________________
9 khahn 1.3 We use the $ZZ$, $WZ$ and $Z\gamma$ MC samples listed in Table~\ref{tab:MC} to estimate yields and $m(4\ell)$ shapes for these backgrounds. We correct the acceptances determined from simulation using the procedures described in Section~\ref{sec:Signal}. We determine background yields using the corrected $4e$, $4\mu$ and $2e2\mu$ acceptances ($\alpha_{c}$) for each process:
10 khahn 1.1
11     \begin{eqnarray}
12 khahn 1.3 N^{exp}_{i} & = & \alpha^{c}_{i}\int\mathcal{L}\sigma_{i}
13 khahn 1.1 \end{eqnarray}
14    
15 khahn 1.3 The cross sections used in formula above are taken from Table~\ref{tab:MC}. Table~\ref{tab:MCBG} lists the $\alpha_{c}$ and the expected $2.1\rm~fb^{-1}$ yields for the diboson backgrounds. Figure~\ref{fig:MCshapes} shows a yield-normalized stack of the corresponding $m(4\ell)$ distributions.
16    
17     %-------------------------------------------------
18     \begin{figure}[htb]
19     \begin{center}
20     \includegraphics[width=0.5\linewidth]{figs/HF1.png}
21     \caption{MC Background Shapes.{\bf put the right plot here} }
22     \label{fig:MCshapes}
23     \end{center}
24     \end{figure}
25     %-------------------------------------------------
26 khahn 1.1
27     %-------------------------------------------------
28     \begin{table}[htb]
29     \begin{center}
30     \begin{tabular}{c|cc|cc|cc}
31 khahn 1.3 {\bf Process} & $\alpha^{c}_{ee}$ & $N^{exp}_{ee}$ & $\alpha^{c}_{\mu\mu}$ & $N^{exp}_{\mu\mu}$ & $\alpha^{c}_{2e2\mu}$ & $N^{exp}_{2e2\mu}$ \\
32 khahn 1.1 \hline
33 khahn 1.3 $ZZ^{*}$ & ~ & ~ & ~ & ~ & ~ & ~ \\
34 khahn 1.1 $WZ$ & ~ & ~ & ~ & ~ & ~ & ~ \\
35     $Z\gamma$ & ~ & ~ & ~ & ~ & ~ & ~ \\
36     \hline
37     \end{tabular}
38 khahn 1.3 \caption{MC Background Yields.}
39     \label{tab:MCBG}
40 khahn 1.1 \end{center}
41     \end{table}
42     %-------------------------------------------------
43    
44 khahn 1.3 We consider two sources of systematic uncertainties on the EWK background predictions. The first is due to the uncertainty on the efficiency scale-factors, which we propagate from the tables of Section~\ref{sec:Leptons} to the corrected acceptance for each channel. {\bf still have to do this}. The second uncertainty concerns the influence of missing higher-orders on the mass shapes and kinematics predicted by the MC. We estimate the magnitude of this effect by reweighting the POWHEG samples at generator-level to the $m(4\ell)$ distributions predicted by MCFM with renormalization and factorization scales varied by $\times 2$, $/2$. We take the relative differences in shape are used as an uncertainty in the limit calculation. Figure~\ref{fig:EWKshapeSys} shows the relative shape differences we obtain after reweighting. {\bf done for ZZ, needed for the others}.
45 khahn 1.1
46     %-------------------------------------------------
47     \begin{figure}[bht]
48     \begin{center}
49     \includegraphics[width=0.5\linewidth]{figs/HF1.png}
50 khahn 1.3 \caption{EWK Shape Differences From MCFM Reweight.{\bf get the right plot in here} }
51     \label{fig:EWKshapeSys}
52 khahn 1.1 \end{center}
53     \end{figure}
54     %-------------------------------------------------
55    
56     %_________________________________________________________________
57     \subsection{Instrumental/Fake Backgrounds}\label{sec:fakes}
58     %_________________________________________________________________
59 khahn 1.3 $Z+jets$ , $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$ backgrounds (collectively, $\ell\ell jj$) contribute to the $4\ell$ signal region when jets in these events are either mismeasured as leptons or produce real leptons through secondary interactions. These processes are difficult to accurately simulate so we estimate their contribution from data. We assess $\ell\ell jj$ backgrounds using the ``fakeable object'' technique~\cite{fakeable}. For this method we define ``fakerates'' with respect to loosely identified lepton candidates, referred to as ``denominator objects''. Electron and muon denominator selections are defined in Table~\ref{tab:fo}.
60 khahn 1.1
61     %-------------------------------------------------
62     \begin{table}[htb]
63     \begin{center}
64 khahn 1.3 \begin{tabular}{c|c|c|c}
65     \multicolumn{2}{c|}{Electron} & \multicolumn{2}{|c}{Muon} \\
66 khahn 1.1 \hline
67     variable & requirement & variable & requirement \\
68     \hline
69 khahn 1.3 ~ & ~ & $p_{T}$ & $> 5\rm~GeV$ \\
70     ~ & ~ & type & $\rm Global~||~Tracker$ \\
71     ~ & ~ & $|d_{0}|$ & $< 2\rm~mm$ \\
72     ~ & ~ & $Iso^{pf}_{0.3}$ & $< 3\times p_{T}$ \\
73 khahn 1.1 \end{tabular}
74     \caption{Denominator Object Definitions}\label{tab:fo}
75     \end{center}
76     \end{table}
77     %-------------------------------------------------
78    
79     We calculate the fakerates ($\epsilon_{FR}(p_{T},\eta)$) from samples of events that pass single lepton triggers: \verb|HLT_Ele8| for electrons, \verb|HLT_Mu8| or \verb|HLT_Mu13| for muons. In both channels we reduce contamination from $W\rightarrow \ell\nu$ and $Z/\gamma^{*}\rightarrow\ell\ell$ by vetoing events with $MET > 20\rm~GeV$, or with $m_{T} > 35\rm~GeV$ or with two or more denominator objects of $p_{T} > 10\rm~GeV$. We enrich the samples in background by selecting only those denominator objects opposite to ($\Delta R(\eta,\phi) > 1.0$) a reconstructed $p_{T} > 35\rm~GeV$ jet. Figure~\ref{fig:FR} shows the electron and muon fakerates obtained from this procedure as a function of $p_{T}$.
80    
81     %-------------------------------------------------
82     \begin{figure}[tbp]
83     \begin{center}
84     \includegraphics[width=0.45\linewidth]{figs/frMu.png}
85 khahn 1.3 \includegraphics[width=0.45\linewidth]{figs/frMu.png}
86     \caption{ Muon and Electron Fake Rates.}
87     \label{fig:FR}
88 khahn 1.1 \end{center}
89     \end{figure}
90     %-------------------------------------------------
91    
92     We estimate $\ell\ell jj$ backgrounds in the signal region by applying the fakerates in events that contain a good Z1. First, we select denominator objects that fail identification/isolation to prevent bias from real leptons. Next, we loop over pairs of the denominator objects, weight each leg with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and apply the Z2 kinematic requirements ($12\rm~GeV < m(Z2) < 120$). The denominator in the weight term accounts for the fact that the we only consider candidates that fail full lepton selection. Weighted pairs that pass the Z2 kinematic selection are summed to obtain an estimate of the $\ell\ell jj$ background.
93    
94 khahn 1.3 Table~\ref{tab:fakes} presents $\ell\ell jj$ background estimates for the $2.1\rm~fb^{-1}$ dataset. We maximize the statistical power of the small $Z1 + \ge 2\rm~denominator$ sample by integrating over the flavor of the $Z1$ leptons and then dividing the $Z1$-inclusive prediction between the $4\ell_{e,\mu}$ and $2\ell_{e,\mu}2\ell_{\mu,e}$ channels. The division is performed by assuming equal $ee$ and $\mu\mu$ $Z1$ branching ratios and using an acceptance factor ($=\sim1$, measured from inclusive $Z\rightarrow ee,\mu\mu$ yields {\bf need to double check this. 1 seems strange}) to account for efficiency differences in the detection of electrons and muons.
95 khahn 1.1
96     %-------------------------------------------------
97     \begin{table}[htb]
98     \begin{center}
99     \begin{tabular}{c|c}
100     \hline
101     \multicolumn{2}{c}{Z1-Inclusive $\ell\ell jj$ Yields} \\
102     \hline
103     $Z1 + \mu\mu$ & $0.057 \pm X$ \\
104     $Z1 + ee$ & $X \pm Y$ \\
105     \hline
106     \multicolumn{2}{c}{Per-Channel $\ell\ell jj$ Yields} \\
107     \hline
108     $4\mu$ & $0.044 \pm X$ \\
109     $4e$ & $X \pm Y$ \\
110     $2e2\mu$ & $(0.013 + Z) \pm Y$ \\
111     \hline
112     \end{tabular}
113 khahn 1.3 \caption{Expected $\ell\ell jj$ Events.}
114     \label{tab:fakes}
115 khahn 1.1 \end{center}
116     \end{table}
117     %-------------------------------------------------
118    
119 khahn 1.3 It is difficult to predict $m(4\ell)$ and kinematic shapes for $\ell\ell jj$ background with the limited number of events containing a good $Z1$ and two failing denominator objects. Although loosening the denominator and $Z1,2$ selections helps, these requirements must not be made so loose that distributions from the control region no longer resemble those of the signal region. As an alternative, we study shapes using $Z+jets$, $t\bar{t}$ and $Zb\bar{b}$ MC events that pass our nominal selections. Figure~\ref{fig:fakeshapes}, for example, shows the cross section normalized $m(4\ell)$ distributions from these processes.
120 khahn 1.1
121     %-------------------------------------------------
122     \begin{figure}[tbp]
123     \begin{center}
124     \includegraphics[width=0.45\linewidth]{figs/muFakeShape-4m.png}
125 khahn 1.3 \caption{ Predicted $m(4\ell)$ Distributions for $\ell\ell jj$ Events. {\bf this is an old data-driven plot. Put the MC one here.}}
126     \label{fig:fakeshapes}
127 khahn 1.1 \end{center}
128     \end{figure}
129     %-------------------------------------------------
130    
131     %_________________________________________________________________
132     \subsubsection{Cross Check and Systematics: Light Flavor }\label{sec:lflavor}
133     %_________________________________________________________________
134     We cross-check our procedures by predicting the number of fake leptons in independent control regions enriched in light flavor. We require one $p_{T} > 25\rm~GeV$ lepton candidate that passes our nominal lepton selection and $1+$ same-sign, same-flavor denominator objects. We veto events with $m(\ell\ell)$ between $76-106\rm~GeV$ to reduce real lepton contamination from Z decays.
135    
136     In the muon-channel this selection produces a sample of pure background, of which the primary component is $W+jet$ with a jet faking a muon. The smaller multi-jet backgrounds, consisting of both light and heavy flavor, contain at least two jets that both fake muons. We reduce the heavy flavor contribution in this sample by requiring $|\sigma(IP_{3D})/IP_{3D} < 3|$ for all muon candidates and $MET > 25\rm~GeV$. Relative abundances for events in which the denominator muon passes selection are determined by fitting the resulting MET distribution with a same-sign MC template for $W+jets$ and a Rayleigh distribution for multi-jets. The fit result (Figure~\ref{fig:ssMuon}, left) indicates that $W+jets$ constitutes $\sim80\%$ of the sample. Residual contributions from heavy flavor in the same-sign muon sample are therefore small.
137    
138     %-------------------------------------------------
139     \begin{figure}[htb]
140     \begin{center}
141     \includegraphics[width=0.45\linewidth]{figs/ssMuMET.png}
142     \includegraphics[width=0.45\linewidth]{figs/ssMuMZ1.png}
143     \caption{Fakerate Predictions for Same-sign Muon Events.}\label{fig:ssMuon}
144     \end{center}
145     \end{figure}
146     %-------------------------------------------------
147    
148     Next, we attempt to predict the number of events containing two identified and isolated same-sign muons by applying our fakerates to denominator objects that fail selection. We loop over all such objects, weight each with the appropriate factor of $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$ and sum. The expected and observed $m(\ell\ell)$ distributions are shown in the rightmost plot of Figure~\ref{fig:ssMuon}. The shape of the predicted distribution agrees with the observation, however the yield is under-predicted by $47.2\%$.
149    
150     %This difference can be understood as a result of differences in the composition of the prediction sample (mainly light flavor) and that used to measure the fakerate (a mix of light and heavy flavor).
151    
152 khahn 1.3 {\bf For electrons ...}
153 khahn 1.1 %For electrons, charge misidentification is significant enough to result in a noticible Z-peak. The jet background is however easily estimated from a fit with a same-sign MC Z template and an exponential background PDF. Events selected in data are shown in Figures~\ref{fig:ssMuon} and (\ref{fig:ssEle}) as points. Table~\ref{tab:ssfakes} lists the total number of observed events in the muon-channel and the electron-channel background determined from the fit.
154    
155     %-------------------------------------------------
156     \begin{figure}[htb]
157     \begin{center}
158 khahn 1.3 \includegraphics[width=0.45\linewidth]{figs/ssMuMET}
159     \includegraphics[width=0.45\linewidth]{figs/ssMuMZ1}
160     \caption{ Fakerate Predictions for Same-sign Electron Events. {\bf Plots are currently for muons ... }}
161     \label{fig:ssEle}
162 khahn 1.1 \end{center}
163     \end{figure}
164     %-------------------------------------------------
165    
166     Table summarizes the results of this section. We take $47.2\%$ ($X\%$) as the systematic uncertainty on the muon (electron) fakerate to account for potential biases in our prediction due to differences in light flavor composition.
167    
168     %-------------------------------------------------
169     \begin{table}[tbh]
170     \begin{center}
171 khahn 1.3 \begin{tabular}{c|c|c|c}
172 khahn 1.1 \hline
173     channel & observed & predicted & systematic \\
174     \hline
175 khahn 1.3 ${\rm same~sign} \mu\mu$ & $159$ & $108.04$ & $47.2\%$\\
176     ${\rm same~sign} ee$ & $X$ & $Y$ & $Z\%$ \\
177 khahn 1.1 \hline
178     \end{tabular}
179 khahn 1.3 \caption{Same-sign Control Yields and Systematic}
180     \label{tab:ssfakes}
181 khahn 1.1 \end{center}
182     \end{table}
183     %-------------------------------------------------
184    
185     %_________________________________________________________________
186     \subsubsection{Cross Check and Systematics : Heavy Flavor }\label{sec:hflavor}
187     %_________________________________________________________________
188 khahn 1.3 Backgrounds from $t\bar{t}$ and $Zb\bar{b}/c\bar{c}$ involve real leptons from heavy flavor decays. As with light flavor, a difference in the fraction of heavy flavor in the fakerate and prediction samples can lead to errors in signal region background estimation. We assess the impact of heavy flavor composition differences by applying our fakerate in a sample of relatively pure $Zb\bar{b}/c\bar{c}$ and $t\bar{t}$.
189 khahn 1.1
190 khahn 1.3 The control region consists of events that contain a pair of leptons passing the $Z1$ selection and at least two additional denominator objects with $\sigma(IP_{3D})/IP_{3D} > 4$. Denominators are defined according to the requirements of Table~\ref{tab:fo}. We make no requirement on denominator charge or flavor. The leftmost plot of Figure~\ref{fig:ZHF} compares the observed $m(Z1)$ distributions for events passing this selection in data with cross section normalized predictions from MC. We observe $71$ events and predict $66.3 \pm 2.0$ with $Zb\bar{b}$ and $t\bar{t}$ MC, which confirms that the data sample is indeed dominated by heavy flavor. {\bf update numbers, they're like 62 and 58 now ...}
191 khahn 1.1
192     Next, we require the high-IP denominator objects to additionally pass the more stringent lepton ID and isolation criteria used in our nominal Z2 selection. We estimate $0.81 \pm 0.21$ events from MC and observe 2. Electron and muon fakerates are then applied to the denominator objects in the original $71$ events and, following the procedures described in Section~\ref{sec:lflavor}, we predict $0.84 \pm 0.10$ events. Given the consistent results, we assign no additional systematic uncertainty on our predicted $\ell\ell jj$ background yields.
193    
194     %We then reinstate the $\sigma_{IP_{3D}}/IP_{3D} < 4$ cut and estimate $2.5 \pm 0.4$ events in the signal region from the $Zb\bar{b}$ and $t\bar{t}$ MC. We take this prediction as an estimate of the heavy flavor contribution to our overall $\ell\ell jj$ background esimtate of $XXX$. We assign a s sysmatic uncertainty on the estimated fraction Considering the We assignconsider th
195    
196     %-------------------------------------------------
197     \begin{figure}[htbp]
198     \begin{center}
199     \includegraphics[width=0.45\linewidth]{figs/HFmZ1.png}
200     \includegraphics[width=0.45\linewidth]{figs/HFm4l.png}
201     \caption{$m(Z1)$ and $m(4\ell)$ in the Heavy Flavor control region.}\label{fig:ZHF}
202     \end{center}
203     \end{figure}
204     %-------------------------------------------------
205    
206 khahn 1.3 %We determine a shape for heavy flavor background in the signal region from the distribution of $m(4\ell)$ from the $Z1 + 2\times$ denominator events. The rightmost plot of Figure~\ref{fig:ZHF} compares the $m(4\ell)$ distributions for this selection in data and (cross-section normalized) simulation. We fit both distributions with Landaus and compare the normalized PDFs in Figure~\ref{fig:HFshape}.
207 khahn 1.1
208     %-------------------------------------------------
209 khahn 1.3 %\begin{figure}[htbp]
210     %\begin{center}
211     %\includegraphics[width=0.5\linewidth]{figs/HFshape.png}
212     %\caption{$m(4\ell)$ shapes in the Heavy Flavor control region.}\label{fig:HFshape}
213     %\end{center}
214     %\end{figure}
215 khahn 1.1 %-------------------------------------------------
216     %_________________________________________________________________
217     \subsection{Cross Check and Systematics: $WZ$ }
218     %_________________________________________________________________
219 khahn 1.3 The estimate of $WZ$ background in Table~\ref{tab:MCBG} is entirely MC-based. In addition to the leptons from $W$ and $Z$ decay, an additional ``fake'' lepton is needed for this process to contribute in the $4\ell$ signal region. We cross-check MC predictions with an estimate obtained from the fakeable object method.
220 khahn 1.1
221 khahn 1.3 We begin by requiring three fully selected leptons (two from the Z1 plus one additional) and $1+$ denominator objects. We then perform a single loop to associate the denominator objects with the third lepton. As before, we weight the denominators with $\epsilon_{FR}(p_{T},\eta)/(1-\epsilon_{FR}(p_{T},\eta))$, apply opposite-sign, same-flavor and kinematic selections and sum. The additional, identified lepton with which the denominators are paired is either a fake (from $Z+jets$) or a real lepton (from $WZ$ or $ZZ$ where one of the leptons is not reconstructed). In order to extract the $WZ$ component of the measurement, we need to subtract off the $3\ell$ contribution predicted by MC for $ZZ$ as well as the double-fake estimate described in Section~\ref{sec:fakes}. The latter is double-counted when performing a single denominator loop.
222 khahn 1.1
223     \begin{eqnarray}
224 khahn 1.3 N(WZ) &=& \ell\ell\ell~\Sigma_{i=0}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})} \\
225     ~ &-& 2\times \ell\ell~\Sigma_{i=0}^{Nd}\Sigma_{j=i+1}^{Nd}~\frac{\epsilon(\eta^{i},p_{T}^{i})}{1-\epsilon(\eta^{i},p_{T}^{i})}~\frac{\epsilon(\eta^{j},p_{T}^{j})}{1-\epsilon(\eta^{j},p_{T}^{j})} \\
226     ~ &-& N(WZ)
227 khahn 1.1 \end{eqnarray}
228    
229 khahn 1.3 Table~\ref{tab:WZfake} lists values for the terms in the equation above. The result ... {\bf fill in the table and quote a systematic for WZ}.
230 khahn 1.1
231     %-------------------------------------------------
232     \begin{table}[tbh]
233     \begin{center}
234     \begin{tabular}{|c|c|c|}
235     \hline
236     $4e$ & $4\mu$ & $2e2\mu$ \\
237     \hline
238     $X\pm Y$ & $Z\pm Y$ & $Z\pm Y$ \\
239     \hline
240     \end{tabular}
241 khahn 1.3 \caption{Data-driven Expected $WZ$ Yields}
242     \label{tab:WZfake}
243 khahn 1.1 \end{center}
244     \end{table}
245     %-------------------------------------------------
246    
247