[ViewVC] Diff of: cvsroot/UserCode/benhoob/cmsnotes/StopSearch/overview.tex

Comparing UserCode/benhoob/cmsnotes/StopSearch/overview.tex (file contents):
Revision 1.5 by vimartin, Tue Oct 2 20:48:18 2012 UTC vs.
Revision 1.12 by linacre, Thu Oct 18 21:21:58 2012 UTC

#	Line 1 \| Line 1
1	<	\section{Overview and Analysis Strategy}
1	>	\section{Overview and Strategy for Background Determination}
2		\label{sec:overview}
3
4	–	[REVISE --- MAKE SURE EVERYTHING IS CORRECT]
5	–
4		We are searching for a $t\bar{t}\chi^0\chi^0$ or $W b W \bar{b} \chi^0 \chi^0$ final state
5		(after top decay in the first mode, the final states are actually the same). So to first order
6		this is ``$t\bar{t} +$ extra \met''.
7
8		We work in the $\ell +$ jets final state, where the main background is $t\bar{t}$. We look for
9		\met\ inconsistent with $W \to \ell \nu$. We do this by concentrating on the $\ell \nu$ transverse
10	<	mass ($M_T$), since except for resolution effects, $M_T < M_W$ for $W \to \ell \nu$. Thus, the
10	>	mass ($M_T$), since except for resolution and W-off-shell effects, $M_T < M_W$ for $W \to \ell \nu$. Thus, the
11		initial analysis is simply a counting experiment in the tail of the $M_T$ distribution.
12
13		The event selection is one-and-only-one high \pt\ isolated lepton, four or more jets, and
14	<	some moderate \met\ cut. At least one of the jets has to be btagged to reduce $W+$ jets.
14	>	an \met\ cut. At least one of the jets has to be b tagged to reduce $W+$ jets.
15		The event sample is then dominated by $t\bar{t}$, but there are also contributions from $W+$ jets,
16	<	single top, dibosons, etc.
16	>	single top, dibosons, as well as rare SM processes such as $ttW$.
17
18	<	In order to be sensitive to $\widetilde{t}\widetilde{t}$ production, the background in the $M_T$
19	<	tail has to be controlled at the level of 10\% or better. So this is (almost) a precision measurement.
18	>	% In order to be sensitive to $\widetilde{t}\widetilde{t}$ production, the background in the $M_T$
19	>	% tail has to be controlled at the level of 10\% or better. So this is (almost) a precision measurement.
20
21		The $t\bar{t}$ events in the $M_T$ tail can be broken up into two categories:
22	<	(i) $t\bar{t} \to \ell $+ jets and (ii) $t\bar{t} \to \ell^+ \ell^-$ where one of the two
22	>	(i) $t\bar{t} \to \ell $+ jets, and (ii) $t\bar{t} \to \ell^+ \ell^-$ where one of the two
23		leptons is not found by the second-lepton-veto (here the second lepton can be a hadronically
24		decaying $\tau$).
25	<	For a reasonable $M_T$ cut, say $M_T >$ 150 GeV, the dilepton background is of order 80\% of
25	>	For a reasonable $M_T$ cut, say $M_T >$ 150 GeV, the dilepton background is approximately 70\% of
26		the total. This is because in dileptons there are two neutrinos from $W$ decay, thus $M_T$
27		is not bounded by $M_W$. This is a very important point: while it is true that we are looking in
28		the tail of $M_T$, the bulk of the background events end up there not because of some exotic
29		\met\ reconstruction failure, but because of well understood physics processes. This means that
30	<	the background estimate can be taken from Monte Carlo (MC), after carefully accounting for possible
31	<	data/MC differences. Sophisticated fully ``data driven'' techniques are not really needed.
30	>	the background estimate can be taken from Monte Carlo (MC)
31	>	after carefully accounting for possible
32	>	data/MC differences.
33	>
34	>	The search is performed in a number of Signal Regions (SRs) defined
35	>	by minimum requirements on \met\ and $M_T$. The SRs
36	>	are defined in Section~\ref{sec:SR}.
37	>
38	>	In Section~\ref{sec:CR} we will describe the analysis of various Control Regions
39	>	(CRs) that are used to test the Monte Carlo model and, if necessary,
40	>	to extract data/MC scale factors. In this section we give a
41	>	general description of the procedure. The details of how the
42	>	final background prediction is assembled are given in Section~\ref{sec:bkg_pred}.
43	>
44
45	<	Another important point is that in order to minimize systematic uncertainties, the MC background
46	<	predictions are always normalized to the bulk of the $t\bar{t}$ data, ie, events passing all of the
45	>
46	>	% Sophisticated fully ``data driven'' techniques are not really needed.
47	>
48	>	One general point is that in order to minimize systematic uncertainties, the MC background
49	>	predictions are whenever possible normalized to the bulk of the $t\bar{t}$ data, i.e., events passing all of the
50		requirements but with $M_T \approx 80$ GeV.
51	<	This removes uncertainties
51	>	This (mostly) removes uncertainties
52		due to $\sigma(t\bar{t})$, lepton ID, trigger efficiency, luminosity, etc.
53
54	<	The $\ell +$ jets background, which is dominated by
55	<	$t\bar{t} \to \ell $+ jets, but also includes some $W +$ jets as well as single top,
56	<	is estimated as follows:
57	<	\begin{enumerate}
58	<	\item We select a control sample of events passing all cuts, but anti-btagged, i.e. b-vetoed. This
59	<	sample is now dominated by $W +$ jets. The sample is used to understand the
60	<	$M_T$ tail in $\ell +$ jets processes.
61	<	\item In MC we measure the ratio of the number of $\ell +$ jets events in the $M_T$ tail to
62	<	the number of events with $M_T \approx$ 80 GeV. This ratio turns out to be pretty much the
63	<	same for all sources of $\ell +$ jets.
64	<	\item In data we measure the same ratio but after correcting for the $t\bar{t} \to$ dilepton
65	<	contribution, as well as dibosons etc. The dilepton contribution is taken from MC after
66	<	the correction described below.
67	<	\item We compare the two ratios, as well as the shapes of the data and MC $M_T$ distributions.
68	<	If they do not agree, we try to figure out why and fix it. If they agree well enough, we define a
69	<	data-to-MC scale factor (SF) which is the ratio of the ratios defined in step 2 and 3, keeping track of the
70	<	uncertainty.
71	<	\item We next perform the full selection in $t\bar{t} \to \ell +$ jets MC, and measure this ratio
72	<	again (which should be the same as that in step 2).
60	<	\item
61	<	We perform the full selection in data. We count the number of events with $M_T \approx 80$ GeV, after subtracting off the dilepton contribution,
62	<	and multiply this count by the ratio from step 5 times the data/MC scale factor from step 4.
63	<	%We count the events with $M_T \approx 80$ GeV, we
64	<	%subtract off the dilepton contribution, we multiply the subtracted event count by the ratio from step 5 (or from
65	<	%step 2), and also by the data/MC SF from step 4.
66	<	The result is the prediction for the $\ell +$ jets BG in the $M_T$ tail.
67	<	\end{enumerate}
54	>	\subsection{$\ell +$ jets background}
55	>	\label{sec:ljbg-general}
56	>
57	>	The $\ell +$ jets background is dominated by
58	>	$t\bar{t} \to \ell $+ jets, but also includes some $W +$ jets as well as single top.
59	>	The MC input used in the background estimation
60	>	is the ratio of the number of events with $M_T$ in the signal region
61	>	to the number of events with $M_T \approx 80$ GeV.
62	>	This ratio is (possibly) corrected by a data/MC scale factor obtained
63	>	from a study of CRs, as outlined below.
64	>
65	>	Note that the ratio described above is actually different for
66	>	$t\bar{t}$/single top and $W +$ jets. This is because in $W$ events
67	>	there is a significant contribution to the $M_T$ tail from very off-shell
68	>	$W$s.
69	>	This contribution is much smaller in top events because $M(\ell \nu)$
70	>	cannot exceed $M_{top}-M_b$. Therefore the large \mt\ tail in
71	>	$t\bar{t}$/single top is dominated by jet resolution effects,
72	>	while for \wjets\ events the large \mt\ tail is dominated by off-shell W production.
73
69	–	Steps 1-4 above are all measurements on the b-vetoed samples in data and/or MC. Steps 5 and 6 are performed on the b-tagged sample.
74
75	<	To suppress dilepton backgrounds, we veto events with an isolated track of \pt $>$ 10 GeV.
75	>
76	>	For $W +$ jets the ability of the Monte Carlo to model this ratio
77	>	($R_{wjet}$) is validated in a sample of $\ell +$ jets enriched in
78	>	$W +$ jets by the application of a b-veto.
79	>	The equivalent ratio for top events ($R_{top}$) is tested in a sample of well
80	>	identified $Z \to \ell \ell$ with one lepton added to the \met\
81	>	calculation.
82	>	This sample is well suited to testing the resolution effects on
83	>	the $M_T$ tail, since off-shell effects are eliminated by the $Z$-mass
84	>	requirement.
85	>
86	>	Note that the fact that the ratios are different for
87	>	$t\bar{t}$/single top and $W +$ jets introduces a systematic
88	>	uncertainty in the background calculation because one needs
89	>	to know the relative fractions of these two components in
90	>	the $M_T \approx 80$ GeV lepton $+$ jets sample.
91	>
92	>
93	>	\subsection{Dilepton background}
94	>	\label{sec:dil-general}
95	>
96	>	To suppress dilepton backgrounds, we veto events with an isolated track of \pt\ $>$ 10 GeV (see Sec.~\ref{sec:tkveto} for details).
97		Being the common feature for electron, muon, and one-prong
98		tau decays, this veto is highly efficient for rejecting
99		$t\bar{t}$ to dilepton events. The remaining dilepton background can be classified into the following categories:
#	Line 80 \| Line 105 \| $t\bar{t}$ to dilepton events. The remai
105		%\item 1-prong hadronic tau decay
106		%\item $e$ or $\mu$ possibly from $\tau$ decay
107		%\end{itemize}
108	<	%We have currently no veto against 3-prong taus. For the other two categories, we explicitely
108	>	%We have currently no veto against 3-prong taus. For the other two categories, we explicitly
109		%veto events %with additional electrons and muons above 10 GeV , and we veto events
110		%with an isolated track of \pt\ $>$ 10 GeV. This rejects electrons and muons (either from $W\to e/\mu$ or
111		%$W\to \tau\to e/\mu$) and 1-prong tau decays.
112		%(it turns out that the explicit $e$ or $\mu$ veto is redundant with the isolated track veto).
113		%Therefore the latter two categories can be broken into
114		\begin{itemize}
115	<	\item lepton is out of acceptance $(\|\eta\| > 2.50)$
115	>	\item lepton is out of acceptance $(\|\eta\| > 2.5)$
116		\item lepton has \pt\ $<$ 10 GeV, and is inside the acceptance
117		\item lepton has \pt\ $>$ 10 GeV, is inside the acceptance, but survives the additional isolated track veto
118		\end{itemize}
#	Line 95 \| Line 120 \| $t\bar{t}$ to dilepton events. The remai
120		%Monte Carlo studies indicate that there is no dominant contribution: it is ``a little bit of this,
121		%and a little bit of that''.
122
123	<	The last category includes 3-prong tau decays as well as electrons and muons from W decay that fail the isolation requirement.
124	<	Monte Carlo studies indicate that these three components populate the $M_T$ tail in the proportions of roughly 6\%, 47\%, 47\%.
125	<	We note that at present we do not attempt to veto 3-prong tau decays as they are only 16\% of the total dilepton background according to the MC.
126	<
127	<	The high $M_T$ dilepton backgrounds come from MC, but their rate is normalized to the
128	<	$M_T \approx 80$ GeV peak. In order to perform this normalization in data, the $W +$ jets
129	<	events in the $M_T$ peak have to be subtracted off. This introduces a systematic uncertainty.
123	>	The last category includes 1-prong and 3-prong hadronic tau decays, as well as electrons and muons either from direct W decay or via W$\to\tau\to\ell$ decay
124	>	that fail the isolation requirement.
125	>	% HOOBERMAN: commenting out for now
126	>	%Monte Carlo studies indicate that these three components populate the $M_T$ tail in the proportions of roughly 6\%, 47\%, 47\%.
127	>	We note that at present we do not attempt to veto 3-prong tau decays as they are about 15\% of the total dilepton background according to the MC.
128	>
129	>	The high $M_T$ dilepton background predictions come from MC, but their rate is normalized to the
130	>	$M_T \approx 80$ GeV peak. In order to perform this normalization in
131	>	data, the rare background events in the $M_T$ peak are subtracted off. This also introduces a systematic uncertainty.
132
133		There are two types of effects that can influence the MC dilepton prediction: physics effects
134		and instrumental effects. We discuss these next, starting from physics.
135
136	<	First of all, many of our $t\bar{t}$ MC samples (eg: MadGraph) have
136	>	First of all, many of our $t\bar{t}$ MC samples (e.g. MadGraph) have
137		BR$(W \to \ell \nu)=\frac{1}{9} = 0.1111$.
138		PDG says BR$(W \to \ell \nu) = 0.1080 \pm 0.0009$. This difference matters, so the $t\bar{t}$ MC
139		must be corrected to account for this.
140
141	<	Second, our selection is $\ell +$ 4 or more jets. A dilepton event passes the selection only if there are
141	>	Second, our selection is $\ell +4$ or more jets. A dilepton event passes the selection only if there are
142		two additional jets from ISR, or one jet from ISR and one jet which is reconstructed from the
143		unidentified lepton, {\it e.g.}, a three-prong tau. Therefore, all MC dilepton $t\bar{t}$ samples used
144		in the analysis must have their jet multiplicity corrected (if necessary) to agree with what is
145		seen in $t\bar{t}$ data. We use a data control sample of well identified dilepton events with
146	<	\met\ and at least two jets as a template to ``adjust'' the $N_{jet}$ distribution of the $t\bar{t} \to$
146	>	\met\ and at least one jet (including at least one b-tag) as a template to ``adjust'' the $N_{jet}$ distribution of the $t\bar{t} \to$
147		dileptons MC samples.
148
149		The final physics effect has to do with the modeling of $t\bar{t}$ production and decay. Different
#	Line 127 \| Line 154 \| and $N_{jet}$ dependence, are compared t
154		uncertainty associated with the $t\bar{t}$ generator modeling.
155
156		The main instrumental effect is associated with the efficiency of the isolated track veto.
157	<	We use tag-and-probe to compare the isolated track veto performance in $Z + 4$ jet data and
158	<	MC, and we extract corrections if necessary. Note that the performance of the isolated track veto
157	>	We use tag-and-probe to compare the isolated track veto performance in $Z + 4$ jets data and
158	>	MC. Note that the performance of the isolated track veto
159		is not exactly the same on $e/\mu$ and on one prong hadronic tau decays. This is because
160		the pions from one-prong taus are often accompanied by $\pi^0$'s that can then result in extra
161	<	tracks due to phton conversions. We let the simulation take care of that.
161	>	tracks due to photon conversions. We let the simulation take care of that.
162		Note that JES uncertainties are effectively ``calibrated away'' by the $N_{jet}$ rescaling described above.
163
164		%Similarly, at the moment
#	Line 141 \| Line 168 \| Note that JES uncertainties are effectiv
168		%The sample of events failing the last isolated track veto is an important control sample to
169		%check that we are doing the right thing.
170
171	+	\subsection{Other backgrounds}
172	+	\label{sec:other-general}
173	+	Other backgrounds are $tW$, $ttV$, dibosons, tribosons, and Drell Yan.
174	+	These are small. They are taken from MC with appropriate scale
175	+	factors for trigger efficiency, and reweighting to match the distribution of reconstructed primary vertices in data.
176	+
177	+
178	+	\subsection{Future improvements}
179	+	\label{sec:improvements-general}
180		Finally, there are possible improvements to this basic analysis strategy that can be added in the future:
181		\begin{itemize}
182		\item Move from counting experiment to shape analysis. But first, we need to get the counting

Diff Legend

-–
+Removed lines
-+
+Added lines
-<
+Changed lines
->
+Changed lines

Comparing UserCode/benhoob/cmsnotes/StopSearch/overview.tex (file contents): Revision 1.5 by vimartin, Tue Oct 2 20:48:18 2012 UTC vs. Revision 1.12 by linacre, Thu Oct 18 21:21:58 2012 UTC

Diff Legend

Comparing UserCode/benhoob/cmsnotes/StopSearch/overview.tex (file contents):
Revision 1.5 by vimartin, Tue Oct 2 20:48:18 2012 UTC vs.
Revision 1.12 by linacre, Thu Oct 18 21:21:58 2012 UTC