Data/MC Comparison Fudge Factor and BDT Issues

Data/MC Comparison Fudge Factor and BDT Issues

(I) Preselection Cuts

(a)  Dijets from the jet trees with an associated gamma to one of the jets from the gamma trees

(b) z vertex cut -100cm<z<100cm

(c) Neutral fraction of jet matched to photon > 0.9

(d)  Photon and jet are opposite in φ [cos(φΥ – φjet)] < -0.8

(e)  PT of photon > 7GeV

(f) pT of away side jet > 5GeV

(g) Detector eta of the away side jet : ІηjetІ < 0.8

(h) Detector eta of photon 1.08<η<2.0

(i) Number of hits in each SMD plane (u,v) >3

(II) Monte Carlo Samples

QCD Background scaled to 3.146 pb-1

(a) First partonic bin weight (4-6 GeV) : 3.727

(b) Second partonic bin weight (6-9 GeV) : 3.756

(c) Third partonic bin weight (9-15 GeV) : 3.453

(d) Fourth partonic bin weight (15-35 GeV) : 2.886

Prompt photon sample scaled to 3.146 pb-1

(a) First partonic bin weight (2-3 GeV) : 0.292

(b) Second partonic bin weight (3-4 GeV) : 0.3096

(c) Third partonic bin weight (4-6 GeV) : 0.2881

(d) Fourth partonic bin weight (6-9 GeV) : 0.3175

(e) Fifth partonic bin weight (9-15 GeV) : 0.3349

(d) Fourth partonic bin weight (15-35 GeV) : 0.03013

pT Spectrum of photons after applying a fudge factor of 1.397

Total number of counts in the data = 60310

Total number of counts in the MC = 84242

MC/Data = 1.397

BDT Issues :

These plots are integrated over all pT bins and all preshower conditions.

Following 15 variables were used to model the response:

nCh, nBtow, nEtow, ptBalance, R2x1jet7, R2x1c3x3, R2x1c2x2, R2x2c3x3, nU, nV, fiveStripU/twentyfiveStripU, fiveStripV/twentyfiveStripV, ePre1, ePre2, ePost

BDT setup parameters :

NTrees=120, MaxDepth=1, AdaBoostBeta=0.70, SeparationType = GiniIndex, nCuts=50, PruneMethod=NoPruning

BDT was booked with the following options:

factory->PrepareTrainingAndTestTree( mycuts, mycutb,"SplitMode=Random:NormMode=None:!V" );

Please see the attached BDT_output.txt file to make sure that no weight renormalisation is applied!
 As you see in this file, we start with 965 signal events and 83273 background events and split the whole sample into test and training

samples randomly. So we have a total of (486+41632)42118 events for the test sample. After running the BDT we get the follwoing

BDT discriminant plots for signal and background events.

 

Integral(MVA_BDT_S) = 486                                                               Integral(MVA_BDT_B) = 42118

Few other BDT plots from MC

    

 

  

Then I run the weight file generated by this BDT on my data sample and get the following discriminant plot.