12 |
|
from $W$-decays, which is reconstructed as \met in the |
13 |
|
detector. |
14 |
|
|
15 |
< |
in 30 pb$^{-1}$ we expect $\approx$ 1 SM event in |
15 |
> |
In 30 pb$^{-1}$ we expect $\approx$ 1 SM event in |
16 |
|
the signal region. The expectations from the LMO |
17 |
< |
and LM1 SUSY benchmark points are {\color{red} XX} and |
18 |
< |
{\color{red} XX} events respectively. |
17 |
> |
and LM1 SUSY benchmark points are 15.1 and |
18 |
> |
6.0 events respectively. {\color{red} I took these |
19 |
> |
numbers from the twiki, rescaling from 11.06 to 30/pb. |
20 |
> |
They seem too large...are they really right?} |
21 |
|
|
22 |
|
|
23 |
|
\subsection{ABCD method} |
29 |
|
Thus, we can use an ABCD method in the \met$/\sqrt{\rm SumJetPt}$ vs |
30 |
|
sumJetPt plane to estimate the background in a data driven way. |
31 |
|
|
32 |
< |
\begin{figure}[htb] |
32 |
> |
\begin{figure}[tb] |
33 |
|
\begin{center} |
34 |
|
\includegraphics[width=0.75\linewidth]{uncorrelated.pdf} |
35 |
|
\caption{\label{fig:uncor}\protect Distributions of SumJetPt |
38 |
|
\end{center} |
39 |
|
\end{figure} |
40 |
|
|
41 |
< |
\begin{figure}[htb] |
41 |
> |
\begin{figure}[bt] |
42 |
|
\begin{center} |
43 |
< |
\includegraphics[width=0.75\linewidth]{abcdMC.jpg} |
43 |
> |
\includegraphics[width=0.5\linewidth, angle=90]{abcdMC.pdf} |
44 |
|
\caption{\label{fig:abcdMC}\protect Distributions of SumJetPt |
45 |
|
vs. MET$/\sqrt{\rm SumJetPt}$ for SM Monte Carlo. Here we also |
46 |
< |
show our choice of ABCD regions. {\color{red} We need a better |
47 |
< |
picture with the letters A-B-C-D and with the numerical values |
46 |
< |
of the boundaries clearly indicated.}} |
46 |
> |
show our choice of ABCD regions. {\color{red} Derek, I |
47 |
> |
do not know if this is SM or $t\bar{t}$ only.}} |
48 |
|
\end{center} |
49 |
|
\end{figure} |
50 |
|
|
52 |
|
Our choice of ABCD regions is shown in Figure~\ref{fig:abcdMC}. |
53 |
|
The signal region is region D. The expected number of events |
54 |
|
in the four regions for the SM Monte Carlo, as well as the BG |
55 |
< |
prediction AC/B is given in Table~\ref{tab:abcdMC} for an integrated |
55 |
> |
prediction AC/B are given in Table~\ref{tab:abcdMC} for an integrated |
56 |
|
luminosity of 30 pb$^{-1}$. The ABCD method is accurate |
57 |
< |
to about 10\%. |
57 |
> |
to about 10\%. {\color{red} Avi wants some statement about stability |
58 |
> |
wrt changes in regions. I am not sure that we have done it and |
59 |
> |
I am not sure it is necessary (Claudio).} |
60 |
|
|
61 |
|
\begin{table}[htb] |
62 |
|
\begin{center} |
65 |
|
\begin{tabular}{|l|c|c|c|c||c|} |
66 |
|
\hline |
67 |
|
Sample & A & B & C & D & AC/D \\ \hline |
68 |
< |
ttdil & 6.4 & 28.4 & 4.2 & 1.0 & 0.9 \\ |
69 |
< |
Zjets & 0.0 & 1.3 & 0.2 & 0.0 & 0.0 \\ |
70 |
< |
Other SM & 0.6 & 2.1 & 0.2 & 0.1 & 0.0 \\ \hline |
71 |
< |
total MC & 7.0 & 31.8 & 4.5 & 1.1 & 1.0 \\ \hline |
68 |
> |
ttdil & 6.9 & 28.6 & 4.2 & 1.0 & 1.0 \\ |
69 |
> |
Zjets & 0.0 & 1.3 & 0.1 & 0.1 & 0.0 \\ |
70 |
> |
Other SM & 0.5 & 2.0 & 0.1 & 0.1 & 0.0 \\ \hline |
71 |
> |
total MC & 7.4 & 31.9 & 4.4 & 1.2 & 1.0 \\ \hline |
72 |
|
\end{tabular} |
73 |
|
\end{center} |
74 |
|
\end{table} |
75 |
|
|
76 |
+ |
\subsection{Dilepton $P_T$ method} |
77 |
+ |
\label{sec:victory} |
78 |
+ |
This method is based on a suggestion by V. Pavlunin\cite{ref:victory}, |
79 |
+ |
and was investigated by our group in 2009\cite{ref:ourvictory}. |
80 |
+ |
The idea is that in dilepton $t\bar{t}$ events the lepton and neutrinos |
81 |
+ |
from $W$ decays have the same $P_T$ spectrum (modulo $W$ polarization |
82 |
+ |
effects). One can then use the observed |
83 |
+ |
$P_T(\ell\ell)$ distribution to model the sum of neutrino $P_T$'s which |
84 |
+ |
is identified with the \met. |
85 |
+ |
|
86 |
+ |
Then, in order to predict the $t\bar{t} \to$ dilepton contribution to a |
87 |
+ |
selection with \met$+$X, one applies a cut on $P_T(\ell\ell)+$X instead. |
88 |
+ |
In practice one has to rescale the result of the $P_T(\ell\ell)+$X selection |
89 |
+ |
to account for the fact that any dilepton selection must include a |
90 |
+ |
moderate \met cut in order to reduce Drell Yan backgrounds. This |
91 |
+ |
is discussed in Section 5.3 of Reference~\cite{ref:ourvictory}; for a \met |
92 |
+ |
cut of 50 GeV, the rescaling factor is obtained from the data as |
93 |
+ |
|
94 |
+ |
\newcommand{\ptll} {\ensuremath{P_T(\ell\ell)}} |
95 |
+ |
\begin{center} |
96 |
+ |
$ K = \frac{\int_0^{\infty} {\cal N}(\ptll)~~d\ptll~}{\int_{50}^{\infty} {\cal N}(\ptll)~~d\ptll~}$ |
97 |
+ |
\end{center} |
98 |
+ |
|
99 |
+ |
|
100 |
+ |
Monte Carlo studies give values of $K$ that are typically between 1.5 and 1.6, |
101 |
+ |
depending on selection details. |
102 |
+ |
|
103 |
+ |
There are several effects that spoil the correspondance between \met and |
104 |
+ |
$P_T(\ell\ell)$: |
105 |
+ |
\begin{itemize} |
106 |
+ |
\item $Ws$ in top events are polarized. Neutrinos are emitted preferentially |
107 |
+ |
forward in the $W$ rest frame, thus the $P_T(\nu\nu)$ distribution is harder |
108 |
+ |
than the $P_T(\ell\ell)$ distribution for top dilepton events. |
109 |
+ |
\item The lepton selections results in $P_T$ and $\eta$ cuts on the individual |
110 |
+ |
leptons that have no simple correspondance to the neutrino requirements. |
111 |
+ |
\item Similarly, the \met$>$50 GeV cut introduces an asymmetry between leptons and |
112 |
+ |
neutrinos which is only partially compensated by the $K$ factor above. |
113 |
+ |
\item The \met resolution is much worse than the dilepton $P_T$ resolution. |
114 |
+ |
When convoluted with a falling spectrum in the tails of \met, this result |
115 |
+ |
in a harder spectrum for \met than the original $P_T(\nu\nu)$. |
116 |
+ |
\item The \met response in CMS is not exactly 1. This causes a distortion |
117 |
+ |
in the \met distribution that is not present in the $P_T(\ell\ell)$ distribution. |
118 |
+ |
\item The $t\bar{t} \to$ dilepton signal includes contributions from |
119 |
+ |
$W \to \tau \to \ell$. For these events the arguments about the equivalence |
120 |
+ |
of $P_T(\ell\ell)$ and $P_T(\nu\nu)$ do not apply. |
121 |
+ |
\item A dilepton selection will include SM events from non $t\bar{t}$ |
122 |
+ |
sources. These events can affect the background prediction. Particularly |
123 |
+ |
dangerous are high $P_T$ Drell Yan events that barely pass the \met$>$ 50 |
124 |
+ |
GeV selection. They will tend to push the data-driven background prediction up. |
125 |
+ |
\end{itemize} |
126 |
+ |
|
127 |
+ |
We have studied these effects in SM Monte Carlo, using a mixture of generator and |
128 |
+ |
reconstruction level studies, putting the various effects in one at a time. |
129 |
+ |
For each configuration, we apply the data-driven method and report as figure |
130 |
+ |
of merit the ratio of observed and predicted events in the signal region. |
131 |
+ |
The results are summarized in Table~\ref{tab:victorybad}. |
132 |
+ |
|
133 |
+ |
\begin{table}[htb] |
134 |
+ |
\begin{center} |
135 |
+ |
\caption{\label{tab:victorybad} Test of the data driven method in Monte Carlo |
136 |
+ |
under different assumptions. See text for details.} |
137 |
+ |
\begin{tabular}{|l|c|c|c|c|c|c|c|} |
138 |
+ |
\hline |
139 |
+ |
& True $t\bar{t}$ dilepton & $t\to W\to\tau$& other SM & GEN or & Lepton $P_T$ & \met $>$ 50& obs/pred \\ |
140 |
+ |
& included & included & included & RECOSIM & and $\eta$ cuts & & \\ \hline |
141 |
+ |
1&Y & N & N & GEN & N & N & \\ |
142 |
+ |
2&Y & N & N & GEN & Y & N & \\ |
143 |
+ |
3&Y & N & N & GEN & Y & Y & \\ |
144 |
+ |
4&Y & N & N & RECOSIM & Y & Y & \\ |
145 |
+ |
5&Y & Y & N & RECOSIM & Y & Y & \\ |
146 |
+ |
6&Y & Y & Y & RECOSIM & Y & Y & \\ |
147 |
+ |
\hline |
148 |
+ |
\end{tabular} |
149 |
+ |
\end{center} |
150 |
+ |
\end{table} |
151 |
+ |
|
152 |
+ |
|
153 |
+ |
The largest discrepancy between prediction and observation occurs on the first |
154 |
+ |
line of Table~\ref{tab:victorybad}, {\em i.e.}, at the generator level with no |
155 |
+ |
cuts. We have verified that this effect is due to the polarization of |
156 |
+ |
the $W$ (we remove the polarization by reweighting the events and we get |
157 |
+ |
good agreement between prediction and observation). The kinematical |
158 |
+ |
requirements (lines 2 and 3) do not have a significant additional effect. |
159 |
+ |
Going from GEN to RECOSIM there is a significant change in observed/predicted. |
160 |
+ |
We have tracked this down to the fact that tcMET underestimates the true \met |
161 |
+ |
by $\approx 4\%$\footnote{We find that observed/predicted changes by roughly 0.1 |
162 |
+ |
for each 1.5\% change in \met response.}. Finally, contamination from non $t\bar{t}$ |
163 |
+ |
events can have a significant impact on the BG prediction. The changes between |
164 |
+ |
lines 5 and 6 of Table~\ref{tab:victorybad} is driven by only {\color{red} 3} |
165 |
+ |
Drell Yan events that pass the \met selection. |
166 |
+ |
|
167 |
+ |
An additional source of concern is that the CMS Madgraph $t\bar{t}$ MC does |
168 |
+ |
not include effects of spin correlations between the two top quarks. |
169 |
+ |
We have studied this effect at the generator level using Alpgen. We find |
170 |
+ |
that the bias is a the few percent level. |
171 |
+ |
|
172 |
+ |
Based on the results of Table~\ref{tab:victorybad}, we conclude that the |
173 |
+ |
naive data driven background estimate based on $P_T{\ell\ell)}$ needs to |
174 |
+ |
be corrected by a factor of {\color{red} $1.4 \pm 0.3$ (We need to |
175 |
+ |
decide what this number should be)}. The quoted |
176 |
+ |
uncertainty is based on the stability of the Monte Carlo tests under |
177 |
+ |
variations of event selections, choices of \met algorithm, etc. |
178 |
+ |
|
179 |
+ |
|
180 |
+ |
\subsection{Signal Contamination} |
181 |
+ |
\label{sec:sigcont} |
182 |
+ |
|
183 |
+ |
All data-driven methods are principle subject to signal contaminations |
184 |
+ |
in the control regions, and the methods described in |
185 |
+ |
Sections~\ref{sec:abcd} and~\ref{sec:victory} are not exceptions. |
186 |
+ |
Signal contamination tends to dilute the significance of a signal |
187 |
+ |
present in the data by inflating the background prediction. |
188 |
+ |
|
189 |
+ |
It is hard to quantify how important these effects are because we |
190 |
+ |
do not know what signal may be hiding in the data. Having two |
191 |
+ |
independent methods (in addition to Monte Carlo ``dead-reckoning'') |
192 |
+ |
adds redundancy because signal contamination can have different effects |
193 |
+ |
in the different control regions for the two methods. |
194 |
+ |
For example, in the extreme case of a |
195 |
+ |
new physics signal |
196 |
+ |
with $P_T(\ell \ell) = \met$, an excess of ev ents would be seen |
197 |
+ |
in the ABCD method but not in the $P_T(\ell \ell)$ method. |
198 |
+ |
|
199 |
+ |
|
200 |
+ |
The LM points are benchmarks for SUSY analyses at CMS. The effects |
201 |
+ |
of signal contaminations for a couple such points are summarized |
202 |
+ |
in Table~\ref{tab:sigcontABCD} and~\ref{tab:sigcontPT}. |
203 |
+ |
Signal contamination is definitely an important |
204 |
+ |
effect for these two LM points, but it does not totally hide the |
205 |
+ |
presence of the signal. |
206 |
|
|
207 |
|
|
208 |
+ |
\begin{table}[htb] |
209 |
+ |
\begin{center} |
210 |
+ |
\caption{\label{tab:sigcontABCD} Effects of signal contamination |
211 |
+ |
for the background predictions of the ABCD method including LM0 or |
212 |
+ |
LM1. Results |
213 |
+ |
are normalized to 30 pb$^{-1}$.} |
214 |
+ |
\begin{tabular}{|c||c|c||c|c|} |
215 |
+ |
\hline |
216 |
+ |
SM & LM0 & BG Prediction & LM1 & BG Prediction \\ |
217 |
+ |
Background & Contribution& Including LM0 & Contribution & Including LM1 \\ \hline |
218 |
+ |
x & x & x & x & x \\ |
219 |
+ |
\hline |
220 |
+ |
\end{tabular} |
221 |
+ |
\end{center} |
222 |
+ |
\end{table} |
223 |
+ |
|
224 |
+ |
\begin{table}[htb] |
225 |
+ |
\begin{center} |
226 |
+ |
\caption{\label{tab:sigcontPT} Effects of signal contamination |
227 |
+ |
for the background predictions of the $P_T(\ell\ell)$ method including LM0 or |
228 |
+ |
LM1. Results |
229 |
+ |
are normalized to 30 pb$^{-1}$.} |
230 |
+ |
\begin{tabular}{|c||c|c||c|c|} |
231 |
+ |
\hline |
232 |
+ |
SM & LM0 & BG Prediction & LM1 & BG Prediction \\ |
233 |
+ |
Background & Contribution& Including LM0 & Contribution & Including LM1 \\ \hline |
234 |
+ |
x & x & x & x & x \\ |
235 |
+ |
\hline |
236 |
+ |
\end{tabular} |
237 |
+ |
\end{center} |
238 |
+ |
\end{table} |
239 |
|
|