whizard is hosted by Hepforge, IPPP Durham

Opened 11 years ago

Closed 11 years ago

#545 closed task (fixed)

Revive MLM matching

Reported by: Juergen Reuter Owned by: ALL
Priority: P0 Milestone: v2.2.0
Component: core Version: 2.1.1
Severity: blocker Keywords:
Cc: mail@…

Description

This should include the corresponding entries in the header of the LHEF format.

Change History (28)

comment:1 Changed 11 years ago by kilian

Priority: P3P4

Requires reactivation of the Parton Shower (#44)

comment:2 Changed 11 years ago by kilian

Type: defecttask

comment:3 Changed 11 years ago by Juergen Reuter

For the MLM matching with PYTHIA, there where additional tags for the MLM matching settings to be written into the header of the LHE file. This should not be forgotten.

comment:4 Changed 11 years ago by Juergen Reuter

Owner: changed from kilian to Juergen Reuter
Status: newassigned

There is no other stupid guy doing this !

comment:5 Changed 11 years ago by Juergen Reuter

Priority: P4P0
Severity: normalmajor

comment:6 Changed 11 years ago by Juergen Reuter

Uff, it seems that in 2.2.0 at some point one gets stuck ... :( Investigating

comment:7 Changed 11 years ago by Juergen Reuter

Uff, it seems that in 2.2.0 at some point one gets stuck ... :( Investigating

comment:8 Changed 11 years ago by Juergen Reuter

(Sorry for the double entry). On my laptop, using the EEMatching3P.sin SINDARIN file, and checkpoint = 25, the program stalls after generating 13,725 events. WTF!? Any idea, how I can check where the program is stuck?

comment:9 Changed 11 years ago by Juergen Reuter

It gets stuck in the middle of an event:

<event>
 14 1  1.0000000000E+00  9.1000000000E+01 -1.0000000000E+00  1.1780000000E-01
 11 -1 0 0 0 0  0.0000000000E+00  0.0000000000E+00  4.5499999997E+01  4.5500000000E+01  5.1099700000E-04  0.0000000000E+00  9.0000000000E+00
 -11 -1 0 0 0 0  0.0000000000E+00  0.0000000000E+00 -4.5499999997E+01  4.5500000000E+01  5.1099700000E-04  0.0000000000E+00  9.0000000000E+00
 -2 1 1 2 0 501 -5.8227482585E+00  3.8356974321E+00 -7.5780912457E+00  1.0297788064E+01  0.0000000000E+00  0.0000000000E+00  9.0000000000E+00
 21 1 1 2 501 502 -8.4359925301E+00  8.3248265979E+00 -1.0714087351E+01  1.5976870020E+01  0.0000000000E+00  0.0000000000E+00  9.0000000000E+00
 21 1 1 2 502 503 -4.8752057726E+00  3.5224408401E+00 -6.0769401400E+00  8.5501124123E+00  0.0000000000E+00  0.0000000000E+00  9.0000000000E+00

comment:10 Changed 11 years ago by Juergen Reuter

Next info: on the DESY cluster it stucks at _exactly_ the same position! gdb tells me that it breaks in function unlink (). This should delete a file!? (is it also for scratch files).

comment:11 Changed 11 years ago by Juergen Reuter

Next info: it was not the putting of the code inside the Noweb file and the polishing. I checked with the last commit before, it also stucks after 13,700 events.

comment:12 Changed 11 years ago by Juergen Reuter

I checked how often the subroutine apply_shower_particle_set is called, depending on the seed this happens a few thousand times, then everything is reset, apply_shower_particle_set is called infintely many times, but no events are written out.

comment:13 Changed 11 years ago by Juergen Reuter

Cc: mail@… added

So what happens is that at a certain point, all events are vetoed by the mlm_matching subroutine for the reason: "not enough PS jets". So it seems that Pythia generates too less PS jets that survive the yt cut? Hm, SS, any ideas?

comment:14 Changed 11 years ago by Juergen Reuter

EEMatching4P.sin is stuck in the same way as EEMatching3P.sin is, but EEMatching5P.sin runs through (with the seed denoted). Big question now: did this ever happen in 2.1.1. as well and just got unnoticed? What is the origin of the infinite veto cascade!?

comment:15 Changed 11 years ago by Juergen Reuter

One more info: using EEMatching3P.sin with seed = 1, after call 13999, the 2->2 process is _never_ called again, and this is the very moment at which the 2->3 process starts having _all_ events rejected. WTF!?

comment:16 in reply to:  14 Changed 11 years ago by Juergen Reuter

Replying to jr_reuter:

EEMatching4P.sin is stuck in the same way as EEMatching3P.sin is, but EEMatching5P.sin runs through (with the seed denoted). Big question now: did this ever happen in 2.1.1. as well and just got unnoticed? What is the origin of the infinite veto cascade!?

Well, JRR, EEMatching5P.sin gets stuck the same way. I forgot to switch on the parton shower, so nothing at all happened. That is the reason why this one ran through. So, apparently, _all_ cases are failing, also for all kinds of seeds!

comment:17 Changed 11 years ago by Juergen Reuter

I checked the same for 2.1.1.: there you have events for the 2->2 process until the very end.

comment:18 Changed 11 years ago by Juergen Reuter

After having thought about it, I guess the logic is the other way round: because there is no valid event found any longer the 2->3 process, the 2->2 is never called again.... I'm in need of help, utterly!

comment:19 Changed 11 years ago by Juergen Reuter

I still cannot formulate my suspicion, but maybe it has to do with the fact how vetoed events are treated. In 2.1, the only way an event could get vetoed was by the parton shower (namely by the MLM and dummy CKKW matching routines). This was transferred into the event%is_vetoed flag. In the simulations module there was a condition in simulation_generate_event which is only for the case that a luminosity value was given exceeding n_events, and the whole thing was wrapped into a MAX_TRIES_FOR_SINGLE_EVENT loop (still not being reimplemented). The generate loop was only exited if the event was valid and not vetoed. Now, the event_transform returns with a probability that is either 1 if the event is valid and not vetoed and 0 otherwise. If the probability is smaller than a random number x the event is rejected. So either it is always accepted or always rejected, same situation basically as in WHIZARD 2.1. Seems that this doesn't bite nor hurt. HELP!!!

comment:20 Changed 11 years ago by Juergen Reuter

Owner: changed from Juergen Reuter to ALL
Severity: majorblocker
Status: assignednew

GIVING UP ... SOMEONE HAS TO TAKE OVER.... :(((

comment:21 Changed 11 years ago by Bijan Chokoufe Nejad

The matching with the Whizard shower runs through for EEMatching[345]W.sin and produces the desired, showered LHE files. Only EEMatching3W.sin gives an Error after approx 74000 events:

   Interactions:
    Interaction number 1
 Shower parton:
      1  [   11]        0.000E+00 0.000E+00  4.550E+01    4.55000E+01   
2.61118E-07    2.61118E-07
    0.00000E+00          ********                          I  T 1 CI   0   0
 Shower parton:
      2  [  -11]        0.000E+00 0.000E+00 -4.550E+01    4.55000E+01   
2.61118E-07    2.61118E-07
    0.00000E+00          ********                          I  T 1 CI   0   0
 Shower parton:
      3  [    2]    7   ********* 8.712E+00 -2.969E+01    3.44889E+01   
1.94451E+02    1.94451E+02
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
12 13T 1 CI   1   0
 Shower parton:
      4  [   -2]    9   1.906E+01 3.588E+00  2.919E+01    3.75020E+01   
1.78523E+02    1.78523E+02
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
10 11T 1 CI   0   2
 Shower parton:
      5  [   21]    7   ********* *********  5.091E-01    1.90091E+01   
4.22620E+01    4.22620E+01
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
14 15T 1 CI   2   1


     Partons:
 Shower parton:
      1  [   11]        0.000E+00 0.000E+00  4.550E+01    4.55000E+01   
2.61118E-07    2.61118E-07
    0.00000E+00          ********                          I  T 1 CI   0   0
 Shower parton:
      2  [  -11]        0.000E+00 0.000E+00 -4.550E+01    4.55000E+01   
2.61118E-07    2.61118E-07
    0.00000E+00          ********                          I  T 1 CI   0   0
 Shower parton:
      3  [    2]    7   ********* 8.712E+00 -2.969E+01    3.44889E+01   
1.94451E+02    1.94451E+02
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
12 13T 1 CI   1   0
 Shower parton:
      4  [   -2]    9   1.906E+01 3.588E+00  2.919E+01    3.75020E+01   
1.78523E+02    1.78523E+02
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
10 11T 1 CI   0   2
 Shower parton:
      5  [   21]    7   ********* *********  5.091E-01    1.90091E+01   
4.22620E+01    4.22620E+01
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
14 15T 1 CI   2   1
 Shower parton:
      7  [   94]    9   ********* ********* -2.919E+01    5.34980E+01   
1.63416E+03    1.63416E+03
    0.00000E+00    ******** ******** ******** ******** ******** bF s 
C:  3  5T 1 CI   0   0
 Shower parton:
      9  [   94]        0.000E+00 *********  3.553E-15    9.10000E+01   
8.28100E+03    8.28100E+03
    0.00000E+00    ******** ******** ******** ******** ******** bF   
C:  7  4T 1 CI   0   0
-------------------------------------------------------
 Shower parton:
     10  [   -2]    4   1.886E+01 5.828E+00  2.699E+01    3.41618E+01   
4.91923E+01    4.91923E+01
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
16 17F 1 CI   0   6
 Shower parton:
     11  [   21]    4   2.012E-01 *********  2.199E+00    3.34023E+00   
1.26406E+00    1.26406E+00
    0.00000E+00    ******** ******** ******** ******** ******** bF s  C:
18 19F 1 CI   6   2
 Shower parton:
     12      2     3   0.000E+00 0.000E+00  0.000E+00    1.34539E+01   
1.81008E+02    2.00695E+01
    0.00000E+00                                            Ff F 1 CI   4   0
 Shower parton:
     13     21     3   0.000E+00 0.000E+00  0.000E+00    2.10350E+01   
4.42470E+02    4.07239E+01
    0.00000E+00                                            FfsF 1 CI   1   4
 Shower parton:
     14     21     5   0.000E+00 0.000E+00  0.000E+00    2.83753E+00   
8.05159E+00    4.22620E+01
    0.00000E+00                                            Ff F 1 CI   2   5
 Shower parton:
     15     21     5   0.000E+00 0.000E+00  0.000E+00    1.61716E+01   
2.61521E+02    4.22620E+01
    0.00000E+00                                            Ff F 1 CI   5   1
 Shower parton:
     16     -2    10   0.000E+00 0.000E+00  0.000E+00    2.69181E+01   
7.24586E+02    4.91923E+01
    0.00000E+00                                            Ff F 1 CI   0   7
 Shower parton:
     17     21    10   0.000E+00 0.000E+00  0.000E+00    7.24363E+00   
5.24701E+01    4.91923E+01
    0.00000E+00                                            Ff F 1 CI   7   6
 Shower parton:
     18     21    11   0.000E+00 0.000E+00  0.000E+00    7.91149E-01   
6.25916E-01    1.26406E+00
    0.00000E+00                                            Ff F 1 CI   6   8
 Shower parton:
     19     21    11   0.000E+00 0.000E+00  0.000E+00    2.54908E+00   
6.49782E+00    1.26406E+00
    0.00000E+00                                            Ff F 1 CI   8   2
    Total Momentum: 0.000E+000.000E+000.000E+00 9.100E+01
 ISR finished: T
 FSR finished: F
******************************************************************************
*** ERROR: BUG: too many loops in simulate_children_ana (?)
******************************************************************************
|             (WHIZARD run continues)

Any Ideas? Could this be a similar configuration as the one Pythia stucks at? In the meantime I will make the error message a bit less arcane..

comment:22 Changed 11 years ago by Bijan Chokoufe Nejad

As promised the polished version of that same event which causes simulate_children_ana to return after 1000 tries. What puzzles me is that partons 12-19 have exactly zero three momenta. Looks almost as if it hasn't been set? I will try to compile with -finit-real=nan...

   Interactions: 
    Interaction number 1
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      1			   11	     		 4.55000E+01	 0.00000E+00	 0.00000E+00	 4.55000E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 2.61118E-07	 2.61118E-07		 0.00000E+00	   0	   0	
 Parton    belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      2			  -11	     		 4.55000E+01	 0.00000E+00	 0.00000E+00	-4.55000E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 2.61118E-07	 2.61118E-07		 0.00000E+00	   0	   0	
 Parton    belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      3			    2	    7		 3.44889E+01	-6.11496E+00	 8.71190E+00	-2.96941E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.94451E+02	 1.94451E+02		 0.00000E+00	   1	   0	
 Parton is FSR,  is simulated, has children:    12   13, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      4			   -2	    9		 3.75020E+01	 1.90588E+01	 3.58781E+00	 2.91850E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.78523E+02	 1.78523E+02		 0.00000E+00	   0	   2	
 Parton is FSR,  is simulated, has children:    10   11, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      5			   21	    7		 1.90091E+01	-1.29438E+01	-1.22997E+01	 5.09096E-01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 4.22620E+01	 4.22620E+01		 0.00000E+00	   2	   1	
 Parton is FSR,  is simulated, has children:    14   15, belongs to  1
	


     Partons:
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      1			   11	     		 4.55000E+01	 0.00000E+00	 0.00000E+00	 4.55000E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 2.61118E-07	 2.61118E-07		 0.00000E+00	   0	   0	
 Parton    belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      2			  -11	     		 4.55000E+01	 0.00000E+00	 0.00000E+00	-4.55000E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 2.61118E-07	 2.61118E-07		 0.00000E+00	   0	   0	
 Parton    belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      3			    2	    7		 3.44889E+01	-6.11496E+00	 8.71190E+00	-2.96941E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.94451E+02	 1.94451E+02		 0.00000E+00	   1	   0	
 Parton is FSR,  is simulated, has children:    12   13, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      4			   -2	    9		 3.75020E+01	 1.90588E+01	 3.58781E+00	 2.91850E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.78523E+02	 1.78523E+02		 0.00000E+00	   0	   2	
 Parton is FSR,  is simulated, has children:    10   11, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      5			   21	    7		 1.90091E+01	-1.29438E+01	-1.22997E+01	 5.09096E-01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 4.22620E+01	 4.22620E+01		 0.00000E+00	   2	   1	
 Parton is FSR,  is simulated, has children:    14   15, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      7			   94	    9		 5.34980E+01	-1.90588E+01	-3.58781E+00	-2.91850E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.63416E+03	 1.63416E+03		 0.00000E+00	   0	   0	
 Parton is FSR,  is simulated, has children:     3    5, belongs to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
      9			   94	     		 9.10000E+01	 0.00000E+00	-8.88178E-16	 3.55271E-15
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 8.28100E+03	 8.28100E+03		 0.00000E+00	   0	   0	
 Parton is FSR,   has children:     7    4, belongs to  1
	
-------------------------------------------------------
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     10			   -2	    4		 3.41618E+01	 1.88576E+01	 5.82793E+00	 2.69863E+01
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 4.91923E+01	 4.91923E+01		 0.00000E+00	   0	   6	
 Parton is FSR,  is simulated, has children:    16   17, doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     11			   21	    4		 3.34023E+00	 2.01219E-01	-2.24012E+00	 2.19873E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.26406E+00	 1.26406E+00		 0.00000E+00	   6	   2	
 Parton is FSR,  is simulated, has children:    18   19, doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     12			    2	    3		 1.34539E+01	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 1.81008E+02	 2.00695E+01		 0.00000E+00	   4	   0	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     13			   21	    3		 2.10350E+01	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 4.42470E+02	 4.07239E+01		 0.00000E+00	   1	   4	
 Parton is FSR, is final, is simulated, doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     14			   21	    5		 2.83753E+00	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 8.05159E+00	 4.22620E+01		 0.00000E+00	   2	   5	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     15			   21	    5		 1.61716E+01	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 2.61521E+02	 4.22620E+01		 0.00000E+00	   5	   1	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     16			   -2	   10		 2.69181E+01	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 7.24586E+02	 4.91923E+01		 0.00000E+00	   0	   7	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     17			   21	   10		 7.24363E+00	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 5.24701E+01	 4.91923E+01		 0.00000E+00	   7	   6	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     18			   21	   11		 7.91149E-01	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 6.25916E-01	 1.26406E+00		 0.00000E+00	   6	   8	
 Parton is FSR, is final,  doesnt belong to  1
	
 Shower parton <nr>	<type>	 <parent>	<mom(0:4)>
     19			   21	   11		 2.54908E+00	 0.00000E+00	 0.00000E+00	 0.00000E+00
 <p4square>		<t>	 	<scale>		<c1>	<c2>
 6.49782E+00	 1.26406E+00		 0.00000E+00	   8	   2	
 Parton is FSR, is final,  doesnt belong to  1
	
    Total Momentum [0:4]:  9.10000E+01	 0.00000E+00	 0.00000E+00	 0.00000E+00	
 ISR finished: T
 FSR finished: F
******************************************************************************
*** ERROR: BUG: too many loops in simulate_children_ana (?)
******************************************************************************

comment:23 Changed 11 years ago by Bijan Chokoufe Nejad

Program received signal SIGFPE: Floating-point exception - erroneous arithmetic operation incoming at exactely this event.. backtracing the exception now :/

comment:24 Changed 11 years ago by sschmidt

The "too many loops in simulate_children_ana"-issue has nothing to do with MLM matching. It occurs if in the WHIZARD PS for a splitting the splitting of the childrens cannot be found, probably due to momentum conservation. It shouldn't occur and in my tests never did...

I can look into the MLM-problems but it'll be at least a weak until I have enough time.

comment:25 in reply to:  24 Changed 11 years ago by Bijan Chokoufe Nejad

Replying to sschmidt:

The "too many loops in simulate_children_ana"-issue has nothing to do with MLM matching. It occurs if in the WHIZARD PS for a splitting the splitting of the childrens cannot be found, probably due to momentum conservation. It shouldn't occur and in my tests never did...

I can look into the MLM-problems but it'll be at least a weak until I have enough time.

Indeed. Maybe it just didn't show up earlier due to lucky seeds. It happens in 2 out of 10 seeds. I have increased the numerical safety net you have for cstep and run it now with 50 seeds to make sure that the Whizard shower is completely fine now.

comment:26 Changed 11 years ago by Juergen Reuter

Event_transforms should have an option to veto. (WK promised that)

JRR and BCN will rewrite the SINDARIN files.

comment:27 Changed 11 years ago by Juergen Reuter

As of r5623, all Pythia examples for e+ e- are working, the WHIZARD shower examples in principle, too, but there is the infinite loop in shower_simulate_children_ana that occurs in 5-6% of the cases. Still not understood. Drell-Yan examples under investigation.

comment:28 Changed 11 years ago by Juergen Reuter

Resolution: fixed
Status: newclosed

We are roughly back to the status of 2.1. Improvements are the setup for writing and reading scratch LHE event for transferral to and from Pythia. Some hiccups of the form `Failed to read LHEF event information. Will assume end of file has been reached.` are not happening now because of this improvement. All matching are running and only very rarely run into the problems now split off into #632. Also now, there are some poor man's tests for FSR and ISR matching (formerly fsr_shower and isr_shower). Closing this one.

Note: See TracTickets for help on using tickets.