- L4 odd sentence “still at its infancy with respect to…”
We prefer to keep this sentece. The introduction has been rephrased in some points.
- It might be worth stating in the Abstract that the signal is EW-produced W(lnu) + W/Z(qq) + qq.
We find this syntax a bir cumbersome, but we have added the clarification of "plus 2 jets" in the abstract .
- Reference [12] should go when CMS is mentioned in line 23.
Moved reference
- L36: you prefer these backgrounds taken from data if simulations exist?
The text has been improved to specify that these backgrounds are estimated from simulation but with data-driven corrections.
- L40: they said in line 35 that the ttbar background is taken from data, and now they claim it is taken from MC. I guess it's because the shape comes from MC and the normalization from data, but then it could be better explained.
An explanation paragraph has been added.
- L48: is the recommendation really to apply the top pt reweighting? In case where ttbar MC is used to model the background, for BSM searches, using the official weight might be dangerous according to the official twiki.
The application of top pt reweighting in the phase space of the analysis has been studied in collaboration with the HWW group, which shares the same set datasets for their semileptonic analyses, and has been proven to be helpful in correcting the top Pt MC.
- L60: why neglect it and not include it as a systematic?
Include the interference effect as a systematic is not the most correct approach since it affects also the QCD-VV production.
Given its really small contribution we prefered to neglect it for this study.
- L92. Hasn't the HLT bandwidth increased from 2016 to 2018?
We think that it would be too much detail for this paper: we used the recommended text from the Twiki
https://twiki.cern.ch/twiki/bin/viewauth/CMS/Internal/PubDetector.
- L93-103 -> The pt and eta requirements for the AK4 jets are explicitly given but nothing is said about the AK8 jets. I would suggest adding this information.
Added the eta requirement and improved in general the section.
- Lines 104-115 -> Selection criteria on the VBS jets are mentioned here although the criteria to tag a jet as VBS are not given until the next paragraph. I would avoid making statements on objects that were not defined yet.
The order of paragraphs in Section 3 have been improved
- L108. In line 105 you wrote tag-jets and here you've written tag jets.
Fixed
- L109: why not put an upper bound on met to remove some additional backgrounds if only a limited amount is expected from the signal?
We did not consider necessary this cut, since there is not upper limit to the momentum of the neutrino in the selected signal process.
- L111. It might be necessary to write at least a sentence about the pileup.
Added
- Eq 1, a parenthesis is missing, maybe the eq would gain readability with some []
Fixed
- L116-127 -> It's not clear to me how the selection and tagging of the jets is done. In particular the question comes when there are overlapping situations like for example one jet that gives together with another jet the highest mass (so it would be VBS) but with another jet the closest to the W mass (so it would be tag-jet). How are these situations resolved? Please add this information to the text so other people can reproduce the procedure.
As the text says the pair of jets closer to mW/Z “out of the remaining jets” is selected as the Vhad jet pair.
Added “out of the remaining jets after the VBS tag jet selection” to be more explicit but it may sound redundant.
- L132: I find a bit odd that the MV are changed by just 5 and 10 GeV These intervals have been optimized to have the mass peak of the hadronically decaying vector boson centered both in the resolved and boosted categories.
They are different because of the differences between the AK8 jet mass calibration and invariant mass built with 2 AK4 jets.
- L133. Why not "to have a good reconstructed on-shell W/Z boson"?
Improved
- L140-141 -> The sentence states that "minor backgrounds" such as DY, VBF-V, VVV, Vgamma processes are estimated from MC. However, in figure 4, particularly for the boosted signal region, some of these backgrounds are absolutely dominant for the bins of greatest sensitivity. In the rest of the text, nothing or very little is said about which crosschecks have been done to validate this MC. Could you please elaborate on how you build the confidence that these MC simulations are correctly describing the data.
The only background that has a non negligible contribution in the high score bin in the boosted category is VBF-V,
that however will profit from dedicated measurements being a quite difficult background to isolate.
Given this, their estimated contribution is performed by means of the most accurate MC predictions,
and all the theoretical and experimental uncertainties have been considered.
- L151-164 -> What are the purities of W+jets and ttbar in the corresponding W+jets and ttbar control regions? Is it safe to normalize these backgrounds using these regions or there are substantial contaminations from other backgrounds.
In the W+jets CR the W+jets fraction is 55% in the resolved and 60% in the boosted category,
and the main second background is nonprompt (30%) which has a large normalization flat uncertainty included in the fit.
The top contribution in the W+jets CR is only 3% (5%) in the resolved (boosted) category.
Therefore it is safe to use that region to measure the W+jets normalization.
In the top CR instead the top quark background fraction is 90%, therefore it is quite pure to measure the top contribution.
- L155: Explain a bit more "in a differential way"
It is differential in the sense that the W+jets MC sample is corrected splitting it sub-categories using a 2D binning in two observables, as explained in the text.
- L162-163 -> I am not sure I fully understood the method of dividing the W+jets and ttbar in two regions to compare the corresponding scale factors. The fact that the numbers are the same in the two regions makes the method consistent but not necessarily correct. Could you please elaborate a bit further on this?
Firstly, only the W+jets CR has been split for this closure test of the W+jets correction.
No need to perform this test for the ttbar.
The rationale of the closure test is to cross-check that the correction factors for W+jets extracted
from the region farther from the signal region (the resonance in terms of mVhad),
can be applied in the region close to the signal, without looking directly at the signal region. Some text has been added to improve the clarity.
- Sec 5, do you train against all the backgrounds or only to the main one?
We train signal agains all backgrounds weighted correctly by their relative importance (cross-section*SF)
- L172-174 -> The SHAP method is mentioned. Even if I found the method and its use very very interesting, no conclusion, statement, or even reference is made to it in the rest of the paper. Therefore I would suggest removing the sentence about its usage, or additionally to add a sentence somewhere explaining what information was extracted from this method, or what were the benefits of using it.
We expanded the sentence about SHAP describing the 3 most important variables as identified by the technique. We have also included a
column in the Tab1 with the SHAP ranking for all the variables
- L175-176 It is very interesting to use this method, but what do you get from it?
We get a cross-check of the model dependence on the input variables and ranking of their importance.
- Table 1 caption, it would be clearer if you move the definition of the variables to the text
Fixed
- very little is said about the systematics applied on the backgrounds taken from MC. The variations of the renormalization and factorizations scales are mentioned however I am missing a single number giving some information about the size of the systematic for these MC-based background estimations. Would it be possible to add it?
All the leptons, and jets systematics applied in the fit have been described. They are applied on the background and signal taken from MC. Added more details in the text as suggested.
- L225 it would be useful if you explain at the beginning what you intend as shape or normalization effect
At the beginning of Section 6 there is already an explanation of shape and normalization effects: “In the signal extraction fit, each uncertainty is represented by a nuisance parameter that morphs the shapes of the distributions for the signal and background processes or scales their total normalization. "
- Figure 3 and others. It might be necessary to write if the uncertainty in the plots is statistical+systematic, or just statistical.
The uncertainty is the total systematic one. Legend improved.
- Figure 4 and others. It would help adding inside the plot canvas the category boosted or resolved.
The category resolved/boosted is always clear from the x-title.
- Figure 5. It cannot be easily extracted from the distributions. What's the fraction of top and W+jets in their respective control regions? What is the signal contamination in such control regions?
Top contribution in top CR is 90%. W+jets contribution in W+jets CR is 55% (60)% in the resolved (boosted) category. The signal yield in the top and W+jets CR is < 0.5% in both.
- Figure 7 clarity suggestion. Use red for expected and black for measured, including central points.
Done
- L261. The W+jets data driven estimation is a bit complex. Is it possible to add some plot / number to support the goodness of the estimation?
Plots in the W+jets CR have been included in Fig5.
Although they are not a direct handle on the W+jets data-driven estimation method since the DNN
distribution is not used at all for the W+jets MC correction factor computation.
Anyway they show a rather good estimation data/MC ratio in the control region.
Additional checks have been done during the review but we do not think that additional space should be dedicated in the paper with details about them.