Differences

This shows you the differences between two versions of the page.

--- btag:tmva [2014/12/17 14:41] – vlambert
+++ btag:tmva [2014/12/17 14:53] – vlambert
@@ Line 10: / Line 10: @@
 **2)** Make the trees really flat without vectors and set variables that are not defined for a given vertex category to a default value. For this, run your ntuples through **createNewTree.py** which will produce sets of new flat ntuples split in event range such as //CombinedSVV2NoVertex_DUSG_0_249999.root// with the shared tree name //"tree"//.
-*For the training, one can either combine these ntuples with hadd or leave them as is for the rest of the processing.
+*//For the training, one can either combine these ntuples with hadd or leave them as is for the rest of the processing. //
 **3)** Produce the category normalization weights for the training sample with **Normalization_Weights.C** and save the output to a text such as //QCD_normweights.txt//. These will be added as a weight branch "weight_norm" which flattens the vertex category distribution for the training sample.
@@ Line 24: / Line 24: @@
 -**weight_norm**    : the normalization weight from the training sample\\
 -**weight_flavour** : the ratio of the flavour prevalences in the evaluation process\\
--**weight**         : (weight_etaPtInc) x (weight_norm x weight_category) x (weight_flavour) \\
+-**weight**         : (weight_etaPtInc) x (weight_norm x weight_category) x (weight_flavour) //-- this can be used for combined trainings//\\
+The training samples are now ready for the training process with **tmva_training.py**. Make sure to create a directory called "//weights//" to save the output class and xml files from the training.
 === Evaluation Samples ===
+**1)** Make the trees really flat without vectors and set variables that are not defined for a given vertex category to a default value. For this, run your ntuples through **createNewTree.py** which will produce sets of new flat ntuples split in event range such as //CombinedSVV2NoVertex_DUSG_0_249999.root// with the shared tree name //"tree"//.
+**2)** The evaluation trees can be skimmed as well to make the evaluation process faster. The script **skimTT.py** will reference the event ranges in the file names for the flat trees and copy new skimmed trees that contain 10% (this can easily be modified) of the events from each of the flavour/category files. The output will be one combined root file for each flavour/category such as //CombinedSVV2NoVertex_DUSG.root//.
+*//Remember not to use the same skimming process for the evaluation as done for the training since one wants to keep the physical vertex category distribution for the process in the evaluation.//
+**3)** Create the vertex category weights for the training samples with **biasTTbar.C** and save the output to a text file such as //BiasDump.txt//.

Physik-Institut

CMS Wiki Pages

Differences

Physik-Institut

CMS Wiki Pages

User Tools

Site Tools

Differences

Page Tools