madgraph crashes when combining runs

Asked by Jona Ackerschott

I am running madgraph 3.5.1 with the following proc card:
```
#************************************************************
#* MadGraph5_aMC@NLO *
#* *
#* * * *
#* * * * * *
#* * * * * 5 * * * * *
#* * * * * *
#* * * *
#* *
#* *
#* VERSION 3.5.1 2023-07-11 *
#* *
#* The MadGraph5_aMC@NLO Development Team - Find us at *
#* https://server06.fynu.ucl.ac.be/projects/madgraph *
#* *
#************************************************************
#* *
#* Command File for MadGraph5_aMC@NLO *
#* *
#* run as ./bin/mg5_aMC filename *
#* *
#************************************************************
set group_subprocesses Auto
set ignore_six_quark_processes False
set low_mem_multicore_nlo_generation False
set complex_mass_scheme False
set include_lepton_initiated_processes False
set gauge unitary
set loop_optimized_output True
set loop_color_flows False
set max_npoint_for_channel 0
set default_unset_couplings 99
set max_t_for_channel 99
set zerowidth_tchannel True
set nlo_mixed_expansion True
set crash_on_error true
import model /home/users/a/ackersch/projects/better_unfolding_priors/r\
epo/data_preparation/parton/madgraph/models/higgs_characterisation_der\
ived_haa
define p = g u c d s u~ c~ d~ s~
define j = g u c d s u~ c~ d~ s~
define l- = e- mu-
define vl = ve vm vt
define vl~ = ve~ vm~ vt~
define l+ = e+ mu+
define nu = vl vl~
generate p p > t t~ x0, x0 > a a, (t > w+ b QNP=0, w+ > l+ nu QNP=0), \
(t~ > w- b~ QNP=0, w- > j j QNP=0)
output /tmp/tmpiym152vp/output

```
using a docker container defined through:
```
FROM scailfin/madgraph5-amc-nlo:mg5_amc3.5.1

USER root
WORKDIR /root

# Do an initial run of MadGraph to build the initial templates.
RUN lhapdf install NNPDF23_lo_as_0130_qed \
    && mg5_aMC \
    && rm py.py

RUN pip3 install jinja2
```
The model I am using is a slightly modified version of the Higgs Characterisation UFO model (http://feynrules.irmp.ucl.ac.be/wiki/NLOModels).

The problem is that I pretty consistently (but not always) get the following error when madgraph is attempting to combine hard-scattering generation runs (contents of run_01_main_debug.log):
```
#************************************************************
#* MadGraph5_aMC@NLO/MadEvent *
#* *
#* * * *
#* * * * * *
#* * * * * 5 * * * * *
#* * * * * *
#* * * *
#* *
#* *
#* VERSION 3.5.1 2023-07-11 *
#* *
#* The MadGraph5_aMC@NLO Development Team - Find us at *
#* https://server06.fynu.ucl.ac.be/projects/madgraph *
#* *
#************************************************************
#* *
#* Command File for MadEvent *
#* *
#* run as ./bin/madevent.py filename *
#* *
#************************************************************
generate_events run_01
Traceback (most recent call last):
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/sum_html.py", line 298, in secure_float
    return float(d)
ValueError: could not convert string to float: 'end-code'

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1543, in onecmd
    return self.onecmd_orig(line, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1492, in onecmd_orig
    return func(arg, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/master_interface.py", line 281, in do_import
    self.cmd.do_import(self, *args, **opts)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madgraph_interface.py", line 5693, in do_import
    self.import_command_file(args[1])
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1691, in import_command_file
    self.exec_cmd(line, precmd=True)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1572, in exec_cmd
    stop = Cmd.onecmd_orig(current_interface, line, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1492, in onecmd_orig
    return func(arg, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/master_interface.py", line 514, in do_launch
    return self.cmd.do_launch(self, line, *argss, **opts)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madgraph_interface.py", line 7461, in do_launch
    ext_program.run()
  File "/usr/local/venv/MG5_aMC/madgraph/interface/launch_ext_program.py", line 79, in run
    self.launch_program()
  File "/usr/local/venv/MG5_aMC/madgraph/interface/launch_ext_program.py", line 690, in launch_program
    launch.run_cmd(command)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1581, in run_cmd
    return self.exec_cmd(line, errorhandling=True, precmd=True)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1570, in exec_cmd
    stop = current_interface.onecmd(line, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1545, in onecmd
    return self.error_handling(error, line)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1521, in error_handling
    stop = self.nice_error_handling(error, line)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madevent_interface.py", line 311, in nice_error_handling
    stop = cmd.Cmd.nice_error_handling(self, error, line)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1543, in onecmd
    return self.onecmd_orig(line, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1492, in onecmd_orig
    return func(arg, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madevent_interface.py", line 2403, in do_generate_events
    self.run_generate_events(switch_mode, args)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/common_run_interface.py", line 7671, in new_fct
    original_fct(obj, *args, **opts)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madevent_interface.py", line 2601, in run_generate_events
    self.exec_cmd('refine %s' % nb_event, postcmd=False)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1572, in exec_cmd
    stop = Cmd.onecmd_orig(current_interface, line, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/extended_cmd.py", line 1492, in onecmd_orig
    return func(arg, **opt)
  File "/usr/local/venv/MG5_aMC/madgraph/interface/madevent_interface.py", line 3650, in do_refine
    combine_runs.CombineRuns(self.me_dir)
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/combine_runs.py", line 80, in __init__
    self.sum_multichannel(channel)
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/combine_runs.py", line 102, in sum_multichannel
    results.add_results(name=alphabet[i % 26] + str((i+1)//26) ,
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/sum_html.py", line 434, in add_results
    oneresult.read_results(filepath)
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/sum_html.py", line 305, in read_results
    data = [secure_float(d) for d in line.split()]
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/sum_html.py", line 305, in <listcomp>
    data = [secure_float(d) for d in line.split()]
  File "/usr/local/venv/MG5_aMC/madgraph/madevent/sum_html.py", line 302, in secure_float
    return float(m.group(1))*10**(float(m.group(2)))
ValueError: could not convert string to float: ''
                              Run Options
                              -----------
               stdout_level : 20 (user set)

                         MadEvent Options
                         ----------------
     automatic_html_opening : False (user set)
        notification_center : True
                   run_mode : 2
              cluster_queue : None (user set)
               cluster_time : None (user set)
               cluster_size : 100
             cluster_memory : 100 (user set)
                    nb_core : 128 (user set)
          cluster_temp_path : None

                      Configuration Options
                      ---------------------
               pythia8_path : /usr/local/venv (user set)
                  hwpp_path : None (user set)
                thepeg_path : None (user set)
                 hepmc_path : None (user set)
           madanalysis_path : None (user set)
          madanalysis5_path : None (user set)
            pythia-pgs_path : None (user set)
                    td_path : None (user set)
               delphes_path : None (user set)
        exrootanalysis_path : None (user set)
               syscalc_path : None (user set)
                 rivet_path : None
                  yoda_path : None
                     lhapdf : lhapdf-config
                 lhapdf_py2 : None
                 lhapdf_py3 : lhapdf-config (user set)
                    timeout : 60
              f2py_compiler : /usr/local/venv/bin/f2py (user set)
          f2py_compiler_py2 : None
          f2py_compiler_py3 : /usr/local/venv/bin/f2py (user set)
                web_browser : None
                 eps_viewer : /usr/bin/ghostscript (user set)
                text_editor : None
           fortran_compiler : /usr/bin/gfortran (user set)
               cpp_compiler : None
                auto_update : 7 (user set)
               cluster_type : condor
      cluster_status_update : (600, 30)
           cluster_nb_retry : 1
         cluster_local_path : None
         cluster_retry_wait : 300
#************************************************************
#* MadGraph5_aMC@NLO *
#* *
#* * * *
#* * * * * *
#* * * * * 5 * * * * *
#* * * * * *
#* * * *
#* *
#* *
#* VERSION 3.5.1 2023-07-11 *
#* *
#* The MadGraph5_aMC@NLO Development Team - Find us at *
#* https://server06.fynu.ucl.ac.be/projects/madgraph *
#* *
#************************************************************
#* *
#* Command File for MadGraph5_aMC@NLO *
#* *
#* run as ./bin/mg5_aMC filename *
#* *
#************************************************************
set group_subprocesses Auto
set ignore_six_quark_processes False
set low_mem_multicore_nlo_generation False
set complex_mass_scheme False
set include_lepton_initiated_processes False
set gauge unitary
set loop_optimized_output True
set loop_color_flows False
set max_npoint_for_channel 0
set default_unset_couplings 99
set max_t_for_channel 99
set zerowidth_tchannel True
set nlo_mixed_expansion True
set crash_on_error true
import model /home/users/a/ackersch/projects/better_unfolding_priors/r\
epo/data_preparation/parton/madgraph/models/higgs_characterisation_der\
ived_haa
define p = g u c d s u~ c~ d~ s~
define j = g u c d s u~ c~ d~ s~
define l- = e- mu-
define vl = ve vm vt
define vl~ = ve~ vm~ vt~
define l+ = e+ mu+
define nu = vl vl~
generate p p > t t~ x0, x0 > a a, (t > w+ b QNP=0, w+ > l+ nu QNP=0), \
(t~ > w- b~ QNP=0, w- > j j QNP=0)
output /tmp/tmpiym152vp/output
######################################################################
## PARAM_CARD AUTOMATICALY GENERATED BY MG5 ####
######################################################################
###################################
## INFORMATION FOR FRBLOCK
###################################
BLOCK FRBLOCK #
      1 1.000000e+03 # lambda
      2 scan1: [0.7071067811865476] # cosa
      3 1.000000e+00 # ksm
      4 scan1: [1.414213562373095] # khtt
      5 scan1: [0.0] # katt
      6 1.000000e+00 # khbb
      7 1.000000e+00 # kabb
      8 1.000000e+00 # khll
      9 1.000000e+00 # kall
      10 1.000000e+00 # khaa
      11 1.000000e+00 # kaaa
      12 1.000000e+00 # khza
      13 1.000000e+00 # kaza
      16 0.000000e+00 # khzz
      17 0.000000e+00 # kazz
      18 0.000000e+00 # khww
      19 0.000000e+00 # kaww
      20 0.000000e+00 # khda
      21 0.000000e+00 # khdz
      22 0.000000e+00 # khdwr (real part of khdw)
      23 0.000000e+00 # khdwi (imaginary part of khdw)
###################################
## INFORMATION FOR LOOP
###################################
BLOCK LOOP #
      1 9.118800e+01 # mu_r
###################################
## INFORMATION FOR SMINPUTS
###################################
BLOCK SMINPUTS #
      1 1.325070e+02 # aewm1
      2 1.166390e-05 # gf
      3 1.180000e-01 # as
###################################
## INFORMATION FOR MASS
###################################
BLOCK MASS #
      5 4.700000e+00 # mb
      6 1.730000e+02 # mt
      15 1.777000e+00 # mta
      23 9.118800e+01 # mz
      25 1.250000e+02 # mx0
      1 0.000000e+00 # d : 0.0
      2 0.000000e+00 # u : 0.0
      3 0.000000e+00 # s : 0.0
      4 0.000000e+00 # c : 0.0
      11 0.000000e+00 # e- : 0.0
      12 0.000000e+00 # ve : 0.0
      13 0.000000e+00 # mu- : 0.0
      14 0.000000e+00 # vm : 0.0
      16 0.000000e+00 # vt : 0.0
      21 0.000000e+00 # g : 0.0
      22 0.000000e+00 # a : 0.0
      24 8.041900e+01 # w+ : cmath.sqrt(mz__exp__2/2. + cmath.sqrt(mz__exp__4/4. - (aew*cmath.pi*mz__exp__2)/(gf*sqrt__2)))
      82 0.000000e+00 # gh : 0.0
###################################
## INFORMATION FOR YUKAWA
###################################
BLOCK YUKAWA #
      5 4.700000e+00 # ymb
      6 1.730000e+02 # ymt
      15 1.777000e+00 # ymtau
###################################
## INFORMATION FOR DECAY
###################################
DECAY 6 1.491500e+00 # wt
DECAY 23 2.441404e+00 # wz
DECAY 24 2.047600e+00 # ww
DECAY 25 4.070000e-03 # wx0
DECAY 1 0.000000e+00 # d : 0.0
DECAY 2 0.000000e+00 # u : 0.0
DECAY 3 0.000000e+00 # s : 0.0
DECAY 4 0.000000e+00 # c : 0.0
DECAY 5 0.000000e+00 # b : 0.0
DECAY 11 0.000000e+00 # e- : 0.0
DECAY 12 0.000000e+00 # ve : 0.0
DECAY 13 0.000000e+00 # mu- : 0.0
DECAY 14 0.000000e+00 # vm : 0.0
DECAY 15 0.000000e+00 # ta- : 0.0
DECAY 16 0.000000e+00 # vt : 0.0
DECAY 21 0.000000e+00 # g : 0.0
DECAY 22 0.000000e+00 # a : 0.0
DECAY 82 0.000000e+00 # gh : 0.0
###################################
## INFORMATION FOR QNUMBERS 82
###################################
BLOCK QNUMBERS 82 # gh
      1 0 # 3 times electric charge
      2 1 # number of spin states (2s+1)
      3 8 # colour rep (1: singlet, 3: triplet, 8: octet)
      4 1 # particle/antiparticle distinction (0=own anti)

#*********************************************************************
# MadGraph5_aMC@NLO *
# *
# run_card.dat MadEvent *
# *
# This file is used to set the parameters of the run. *
# *
# Some notation/conventions: *
# *
# Lines starting with a '# ' are info or comments *
# *
# mind the format: value = variable ! comment *
# *
# To display more options, you can type the command: *
# update to_full *
#*********************************************************************
#
#*********************************************************************
# Tag name for the run (one word) *
#*********************************************************************
  main = run_tag ! name of the run
#*********************************************************************
# Number of events and rnd seed *
# Warning: Do not generate more than 1M events in a single run *
#*********************************************************************
  100000 = nevents ! Number of unweighted events requested
  0 = iseed ! rnd seed (0=assigned automatically=default))
#*********************************************************************
# Collider type and energy *
# lpp: 0=No PDF, 1=proton, -1=antiproton, *
# 2=elastic photon of proton/ion beam *
# +/-3=PDF of electron/positron beam *
# +/-4=PDF of muon/antimuon beam *
#*********************************************************************
  1 = lpp1 ! beam 1 type
  1 = lpp2 ! beam 2 type
  6500.0 = ebeam1 ! beam 1 total energy in GeV
  6500.0 = ebeam2 ! beam 2 total energy in GeV
# To see polarised beam options: type "update beam_pol"

#*********************************************************************
# PDF CHOICE: this automatically fixes alpha_s and its evol. *
# pdlabel: lhapdf=LHAPDF (installation needed) [1412.7420] *
# iww=Improved Weizsaecker-Williams Approx.[hep-ph/9310350] *
# eva=Effective W/Z/A Approx. [2111.02442] *
# edff=EDFF in gamma-UPC [eq.(11) in 2207.03012] *
# chff=ChFF in gamma-UPC [eq.(13) in 2207.03012] *
# none=No PDF, same as lhapdf with lppx=0 *
#*********************************************************************
  nn23lo1 = pdlabel ! PDF set
  230000 = lhaid ! if pdlabel=lhapdf, this is the lhapdf number
# To see heavy ion options: type "update ion_pdf"
#*********************************************************************
# Renormalization and factorization scales *
#*********************************************************************
  False = fixed_ren_scale ! if .true. use fixed ren scale
  False = fixed_fac_scale ! if .true. use fixed fac scale
  91.188 = scale ! fixed ren scale
  91.188 = dsqrt_q2fact1 ! fixed fact scale for pdf1
  91.188 = dsqrt_q2fact2 ! fixed fact scale for pdf2
  -1 = dynamical_scale_choice ! Choose one of the preselected dynamical choices
  1.0 = scalefact ! scale factor for event-by-event scales

#*********************************************************************
# Type and output format
#*********************************************************************
  False = gridpack !True = setting up the grid pack
  -1.0 = time_of_flight ! threshold (in mm) below which the invariant livetime is not written (-1 means not written)
  average = event_norm ! average/sum. Normalization of the weight in the LHEF
# To see MLM/CKKW merging options: type "update MLM" or "update CKKW"

#*********************************************************************
#
#*********************************************************************
# Phase-Space Optimization strategy (basic options)
#*********************************************************************
  0 = nhel ! using helicities importance sampling or not.
                             ! 0: sum over helicity, 1: importance sampling
  1 = sde_strategy ! default integration strategy (hep-ph/2021.00773)
                             ! 1 is old strategy (using amp square)
        ! 2 is new strategy (using only the denominator)
# To see advanced option for Phase-Space optimization: type "update psoptim"
#*********************************************************************
# Customization (custom cuts/scale/bias/...) *
# list of files containing fortran function that overwrite default *
#*********************************************************************
   = custom_fcts ! List of files containing user hook function
#*******************************
# Parton level cuts definition *
#*******************************
  0.0 = dsqrt_shat ! minimal shat for full process
#
#
#*********************************************************************
# BW cutoff (M+/-bwcutoff*Gamma) ! Define on/off-shell for "$" and decay
#*********************************************************************
  15.0 = bwcutoff ! (M+/-bwcutoff*Gamma)
 #*********************************************************************
 # Apply pt/E/eta/dr/mij/kt_durham cuts on decay products or not
 # (note that etmiss/ptll/ptheavy/ht/sorted cuts always apply)
 #*********************************************************************
  False = cut_decays ! Cut decay products
#*********************************************************************
# Standard Cuts *
#*********************************************************************
# Minimum and maximum pt's (for max, -1 means no cut) *
#*********************************************************************
  20.0 = ptj ! minimum pt for the jets
  0.0 = ptb ! minimum pt for the b
  10.0 = pta ! minimum pt for the photons
  10.0 = ptl ! minimum pt for the charged leptons
  0.0 = misset ! minimum missing Et (sum of neutrino's momenta)
  -1.0 = ptjmax ! maximum pt for the jets
  -1.0 = ptbmax ! maximum pt for the b
  -1.0 = ptamax ! maximum pt for the photons
  -1.0 = ptlmax ! maximum pt for the charged leptons
  -1.0 = missetmax ! maximum missing Et (sum of neutrino's momenta)
  {} = pt_min_pdg ! pt cut for other particles (use pdg code). Applied on particle and anti-particle
  {} = pt_max_pdg ! pt cut for other particles (syntax e.g. {6: 100, 25: 50})
#
# For display option for energy cut in the partonic center of mass frame type 'update ecut'
#
#*********************************************************************
# Maximum and minimum absolute rapidity (for max, -1 means no cut) *
#*********************************************************************
  5.0 = etaj ! max rap for the jets
  -1.0 = etab ! max rap for the b
  2.5 = etaa ! max rap for the photons
  2.5 = etal ! max rap for the charged leptons
  0.0 = etajmin ! min rap for the jets
  0.0 = etabmin ! min rap for the b
  0.0 = etaamin ! min rap for the photons
  0.0 = etalmin ! main rap for the charged leptons
  {} = eta_min_pdg ! rap cut for other particles (use pdg code). Applied on particle and anti-particle
  {} = eta_max_pdg ! rap cut for other particles (syntax e.g. {6: 2.5, 23: 5})
#*********************************************************************
# Minimum and maximum DeltaR distance *
#*********************************************************************
  0.4 = drjj ! min distance between jets
  0.0 = drbb ! min distance between b's
  0.4 = draa ! min distance between gammas
  0.0 = drbj ! min distance between b and jet
  0.4 = draj ! min distance between gamma and jet
  0.4 = drjl ! min distance between jet and lepton
  0.0 = drab ! min distance between gamma and b
  0.0 = drbl ! min distance between b and lepton
  0.4 = dral ! min distance between gamma and lepton
  -1.0 = drjjmax ! max distance between jets
  -1.0 = drbbmax ! max distance between b's
  -1.0 = draamax ! max distance between gammas
  -1.0 = drbjmax ! max distance between b and jet
  -1.0 = drajmax ! max distance between gamma and jet
  -1.0 = drjlmax ! max distance between jet and lepton
  -1.0 = drabmax ! max distance between gamma and b
  -1.0 = drblmax ! max distance between b and lepton
  -1.0 = dralmax ! maxdistance between gamma and lepton
#*********************************************************************
# Minimum and maximum invariant mass for pairs *
#*********************************************************************
  0.0 = mmjj ! min invariant mass of a jet pair
  0.0 = mmbb ! min invariant mass of a b pair
  0.0 = mmaa ! min invariant mass of gamma gamma pair
  -1.0 = mmjjmax ! max invariant mass of a jet pair
  -1.0 = mmbbmax ! max invariant mass of a b pair
  -1.0 = mmaamax ! max invariant mass of gamma gamma pair
  {} = mxx_min_pdg ! min invariant mass of a pair of particles X/X~ (e.g. {6:250})
  {'default': False} = mxx_only_part_antipart ! if True the invariant mass is applied only
                       ! to pairs of particle/antiparticle and not to pairs of the same pdg codes.
 #*********************************************************************
 # Minimum and maximum invariant mass for all letpons *
 #*********************************************************************
  0.0 = mmnl ! min invariant mass for all letpons (l+- and vl)
  -1.0 = mmnlmax ! max invariant mass for all letpons (l+- and vl)
 #*********************************************************************
 # Minimum and maximum pt for 4-momenta sum of leptons / neutrino *
 # for pair of lepton includes only same flavor, opposite charge
 #*********************************************************************
#*********************************************************************
# Inclusive cuts *
#*********************************************************************
  0.0 = xptj ! minimum pt for at least one jet
  0.0 = xptb ! minimum pt for at least one b
  0.0 = xpta ! minimum pt for at least one photon
 #*********************************************************************
 # Control the pt's of the jets sorted by pt *
 #*********************************************************************
  0.0 = ptj1min ! minimum pt for the leading jet in pt
  0.0 = ptj2min ! minimum pt for the second jet in pt
  -1.0 = ptj1max ! maximum pt for the leading jet in pt
  -1.0 = ptj2max ! maximum pt for the second jet in pt
  0 = cutuse ! reject event if fails any (0) / all (1) jet pt cuts
 #*********************************************************************
 # Control the Ht(k)=Sum of k leading jets *
 #*********************************************************************
  0.0 = htjmin ! minimum jet HT=Sum(jet pt)
  -1.0 = htjmax ! maximum jet HT=Sum(jet pt)
  0.0 = ihtmin !inclusive Ht for all partons (including b)
  -1.0 = ihtmax !inclusive Ht for all partons (including b)
  0.0 = ht2min ! minimum Ht for the two leading jets
  0.0 = ht3min ! minimum Ht for the three leading jets
  0.0 = ht4min ! minimum Ht for the four leading jets
  -1.0 = ht2max ! maximum Ht for the two leading jets
  -1.0 = ht3max ! maximum Ht for the three leading jets
  -1.0 = ht4max ! maximum Ht for the four leading jets
 #***********************************************************************
 # Photon-isolation cuts, according to hep-ph/9801442 *
 # When ptgmin=0, all the other parameters are ignored *
 # When ptgmin>0, pta and draj are not going to be used *
 #***********************************************************************
  0.0 = ptgmin ! Min photon transverse momentum
  0.4 = r0gamma ! Radius of isolation code
  1.0 = xn ! n parameter of eq.(3.4) in hep-ph/9801442
  1.0 = epsgamma ! epsilon_gamma parameter of eq.(3.4) in hep-ph/9801442
  True = isoem ! isolate photons from EM energy (photons and leptons)
 #*********************************************************************
 # WBF cuts *
 #*********************************************************************
  0.0 = xetamin ! minimum rapidity for two jets in the WBF case
  0.0 = deltaeta ! minimum rapidity for two jets in the WBF case
#*********************************************************************
# maximal pdg code for quark to be considered as a light jet *
# (otherwise b cuts are applied) *
#*********************************************************************
  4 = maxjetflavor ! Maximum jet pdg code
#*********************************************************************
#
#*********************************************************************
# Store info for systematics studies *
# WARNING: Do not use for interference type of computation *
#*********************************************************************
  True = use_syst ! Enable systematics studies
#
  systematics = systematics_program ! none, systematics [python], SysCalc [depreceted, C++]
  ['--mur=0.5,1,2', '--muf=0.5,1,2', '--pdf=errorset'] = systematics_arguments ! see: https://cp3.irmp.ucl.ac.be/projects/madgraph/wiki/Systematics#Systematicspythonmodule
```
Note that this (so far) only happens if I generate on a larger scale on a cpu cluster, not when I test locally (both using the container).

Question information

Language:
English Edit question
Status:
Solved
For:
MadGraph5_aMC@NLO Edit question
Assignee:
No assignee Edit question
Solved by:
Olivier Mattelaer
Solved:
Last query:
Last reply:
Revision history for this message
Olivier Mattelaer (olivier-mattelaer) said :
#1

Can you check for "end-code" within your generated directory?
You should get one (or more) results.dat file which such information.
then in the same directory you should get a log file with the information why that specific channel failed.

This would be usefull to know what is going on. (because it can be hardware related --wall time/cgroup/...-- or crash due to a software reason --like a special phase-space point/...)

Cheers,

Olivier

Revision history for this message
Jona Ackerschott (jackerschott) said :
#2

Sure. I don't see anything there that looks like an error though. For example results.dat in SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G1.0077/ looks like this:
```
 0.00000E+00 0.00000E+00 0.00000E+00 0 0 1 0 0.000E+00 0.000E+00 0.000E+00 0.000E+00
   1 0.00000E+00 0.00000E+00 0.00000E+00 0.00000E+00 0.00000E+00
```
and the corresponding log.txt in the same directory (as well as run1_app.log) looks like this:
```
 Process in group number 1
 A PDF is used, so alpha_s(MZ) is going to be modified
 Old value of alpha_s from param_card: 0.11799999999999999
  ****************************************

       NNPDFDriver version 1.0.3
   Grid: NNPDF23_lo_as_0130_qed_mem0.grid
  ****************************************
 New value of alpha_s from PDF nn23lo1: 0.13000000000000000
 Define smin to 88.360000000000014
 Warning: pt or E min of a jet should in general be >0
 Warning: pt or E min of a jet should in general be >0
 Warning: pt or E min of a gamma should in general be >0
 Warning: pt or E min of a gamma should in general be >0
 *****************************************************
 * MadGraph/MadEvent *
 * -------------------------------- *
 * http://madgraph.hep.uiuc.edu *
 * http://madgraph.phys.ucl.ac.be *
 * http://madgraph.roma2.infn.it *
 * -------------------------------- *
 * *
 * PARAMETER AND COUPLING VALUES *
 * *
 *****************************************************

  External Params
  ---------------------------------

 MU_R = 91.188000000000002
 aEWM1 = 132.50700000000001
 mdl_Gf = 1.1663900000000000E-005
 aS = 0.11799999999999999
 mdl_ymb = 4.7000000000000002
 mdl_ymt = 173.00000000000000
 mdl_ymtau = 1.7769999999999999
 mdl_MT = 173.00000000000000
 mdl_MB = 4.7000000000000002
 mdl_MZ = 91.188000000000002
 mdl_MTA = 1.7769999999999999
 mdl_WT = 1.4915000000000000
 mdl_WZ = 2.4414039999999999
 mdl_WW = 2.0476000000000001
 mdl_MX0 = 125.00000000000000
 mdl_WX0 = 4.0699999999999998E-003
 mdl_Lambda = 1000.0000000000000
 mdl_cosa = 0.70710680000000004
 mdl_kSM = 1.0000000000000000
 mdl_kHtt = 1.4142140000000001
 mdl_kAtt = 0.0000000000000000
 mdl_kHbb = 1.0000000000000000
 mdl_kAbb = 1.0000000000000000
 mdl_kHll = 1.0000000000000000
 mdl_kAll = 1.0000000000000000
 mdl_kHza = 1.0000000000000000
 mdl_kAza = 1.0000000000000000
 mdl_kHzz = 0.0000000000000000
 mdl_kAzz = 0.0000000000000000
 mdl_kHww = 0.0000000000000000
 mdl_kAww = 0.0000000000000000
 mdl_kHda = 0.0000000000000000
 mdl_kHdz = 0.0000000000000000
 mdl_kHdwR = 0.0000000000000000
 mdl_kHdwI = 0.0000000000000000
  Internal Params
  ---------------------------------

 mdl_CKM33 = 1.0000000000000000
 mdl_conjg__CKM22 = 1.0000000000000000
 mdl_lhv = 1.0000000000000000
 mdl_conjg__CKM3x3 = 1.0000000000000000
 mdl_CKM3x3 = 1.0000000000000000
 mdl_CKM22 = 1.0000000000000000
 mdl_Ncol = 3.0000000000000000
 mdl_CA = 3.0000000000000000
 mdl_TF = 0.50000000000000000
 mdl_CF = 1.3333333333333333
 mdl_complexi = (0.0000000000000000,1.0000000000000000)
 mdl_MZ__exp__2 = 8315.2513440000002
 mdl_MZ__exp__4 = 69143404.913893804
 mdl_sqrt__2 = 1.4142135623730951
 mdl_MX0__exp__2 = 15625.000000000000
 mdl_cosa__exp__2 = 0.50000002660624010
 mdl_sina = 0.70710676237309444
 mdl_kHaa = 0.99999990590454391
 mdl_kAaa = -0.0000000000000000
 mdl_kHdw = (0.0000000000000000,0.0000000000000000)
 mdl_nb__2__exp__0_75 = 1.6817928305074290
 mdl_Ncol__exp__2 = 9.0000000000000000
 mdl_MB__exp__2 = 22.090000000000003
 mdl_MT__exp__2 = 29929.000000000000
 mdl_conjg__kHdw = (0.0000000000000000,-0.0000000000000000)
 mdl_aEW = 7.5467711139788835E-003
 mdl_MW = 80.419002445756163
 mdl_sqrt__aEW = 8.6872153846781555E-002
 mdl_ee = 0.30795376724436879
 mdl_MW__exp__2 = 6467.2159543705357
 mdl_sw2 = 0.22224648578577766
 mdl_cw = 0.88190334743339216
 mdl_sqrt__sw2 = 0.47143025548407230
 mdl_sw = 0.47143025548407230
 mdl_g1 = 0.34919219678733299
 mdl_gw = 0.65323293034757990
 mdl_vev = 246.21845810181637
 mdl_vev__exp__2 = 60623.529110035903
 mdl_lam = 0.12886910601690263
 mdl_yb = 2.6995554250465490E-002
 mdl_yt = 0.99366614581500623
 mdl_ytau = 1.0206617000654717E-002
 mdl_muH = 88.388347648318430
 mdl_AxialZUp = -0.18517701861793787
 mdl_AxialZDown = 0.18517701861793787
 mdl_VectorZUp = 7.5430507588273299E-002
 mdl_VectorZDown = -0.13030376310310560
 mdl_VectorAUp = 0.20530251149624587
 mdl_VectorADown = -0.10265125574812294
 mdl_VectorWmDxU = 0.23095271737156670
 mdl_AxialWmDxU = -0.23095271737156670
 mdl_VectorWpUxD = 0.23095271737156670
 mdl_AxialWpUxD = -0.23095271737156670
 mdl_I1x33 = (2.69955542504654901E-002,0.0000000000000000)
 mdl_I2x33 = (0.99366614581500623,0.0000000000000000)
 mdl_I3x33 = (0.99366614581500623,0.0000000000000000)
 mdl_I4x33 = (2.69955542504654901E-002,0.0000000000000000)
 mdl_Vector_tbGp = (-0.96667059156454072,0.0000000000000000)
 mdl_Axial_tbGp = (-1.0206617000654716,-0.0000000000000000)
 mdl_Vector_tbGm = (0.96667059156454072,0.0000000000000000)
 mdl_Axial_tbGm = (-1.0206617000654716,-0.0000000000000000)
 mdl_ee__exp__2 = 9.4835522759998875E-002
 mdl_gAaa = 1.3008566310666950E-005
 mdl_cw__exp__2 = 0.77775351421422245
 mdl_gAza = 4.7794971072590281E-006
 mdl_gHaa = 2.5475109025056106E-005
 mdl_gHza = 3.9182129211851395E-005
 mdl_gw__exp__2 = 0.42671326129048615
 mdl_sw__exp__2 = 0.22224648578577769
  Internal Params evaluated point by point
  ----------------------------------------

 mdl_sqrt__aS = 0.34351128074635334
 mdl_G__exp__2 = 1.4828317324943823
 mdl_G__exp__4 = 2.1987899468922913
 mdl_G__exp__3 = 1.8056676068262196
 mdl_MU_R__exp__2 = 8315.2513440000002
 mdl_gAgg = 7.6274879753643885E-005
 mdl_gHgg = -5.0849919835762592E-005
  Couplings of higgs_characterisation_derived_haa
  ---------------------------------

         GC_4 = -0.12177E+01 0.00000E+00
         GC_5 = 0.00000E+00 0.12177E+01
        GC_11 = 0.00000E+00 0.46191E+00
     GC_3001h = -0.00000E+00 -0.25475E-04
     GC_3001a = 0.00000E+00 -0.00000E+00
     GC_3010a = 0.00000E+00 0.00000E+00
     GC_3010h = -0.00000E+00 -0.70263E+00

 Collider parameters:
 --------------------

 Running at P P machine @ 13000.000000000000 GeV
 PDF set = nn23lo1
 alpha_s(Mz)= 0.1300 running at 2 loops.
 alpha_s(Mz)= 0.1300 running at 2 loops.
 Renormalization scale set on event-by-event basis
 Factorization scale set on event-by-event basis

 getting user params
Enter number of events and max and min iterations:
 Number of events and iterations 1000 5 3
Enter desired fractional accuracy:
 Desired fractional accuracy: 0.10000000000000001
Enter 0 for fixed, 2 for adjustable grid:
Suppress amplitude (0 no, 1 yes)?
 Using suppressed amplitude.
Exact helicity sum (0 yes, n = number/event)?
 Explicitly summing over helicities
Enter Configuration Number:
Running Configuration Number: 1
 Using dconfig= 77
 BW Setting 2 1 2 2 0 0 0 0
 Attempting mappinvarients 1 10
 Determine nb_t
 T-channel found: 0
 Completed mapping 10
 about to integrate 22 1000 5 3 22 1
 Using non-zero grid deformation.
 22 dimensions 1000 events 22 invarients 5 iterations 1 config(s), (0.99)
 Using h-tuple random number sequence.
 Error opening grid
 Using Uniform Grid! 40
 Using uniform alpha 1.0000000000000000
 Grid defined OK
 Masses: 0.000E+00 0.000E+00 0.000E+00 0.000E+00 0.470E+01 0.000E+00 0.000E+00 0.470E+01 0.000E+00 0.000E+00
 Set CM energy to 13000.00
 Mapping Graph 1 to config 1
 Determine nb_t
 T-channel found: 0
 Impossible BW configuration

ls status:
input_app.txt
results.dat
run1_app.log
```

Revision history for this message
Olivier Mattelaer (olivier-mattelaer) said :
#3

One of the results.dat should not be like that.
and should just have a line containing "end-code"

did you try
grep "end-code" . -rin
to find which file has it?

Cheers,

Olivier

> On 23 Apr 2024, at 08:50, Jona Ackerschott <email address hidden> wrote:
>
> Question #809269 on MadGraph5_aMC@NLO changed:
> https://answers.launchpad.net/mg5amcnlo/+question/809269
>
> Status: Answered => Open
>
> Jona Ackerschott is still having a problem:
> Sure. I don't see anything there that looks like an error though. For example results.dat in SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G1.0077/ looks like this:
> ```
> 0.00000E+00 0.00000E+00 0.00000E+00 0 0 1 0 0.000E+00 0.000E+00 0.000E+00 0.000E+00
> 1 0.00000E+00 0.00000E+00 0.00000E+00 0.00000E+00 0.00000E+00
> ```
> and the corresponding log.txt in the same directory (as well as run1_app.log) looks like this:
> ```
> Process in group number 1
> A PDF is used, so alpha_s(MZ) is going to be modified
> Old value of alpha_s from param_card: 0.11799999999999999
> ****************************************
>
> NNPDFDriver version 1.0.3
> Grid: NNPDF23_lo_as_0130_qed_mem0.grid
> ****************************************
> New value of alpha_s from PDF nn23lo1: 0.13000000000000000
> Define smin to 88.360000000000014
> Warning: pt or E min of a jet should in general be >0
> Warning: pt or E min of a jet should in general be >0
> Warning: pt or E min of a gamma should in general be >0
> Warning: pt or E min of a gamma should in general be >0
> *****************************************************
> * MadGraph/MadEvent *
> * -------------------------------- *
> * http://madgraph.hep.uiuc.edu *
> * http://madgraph.phys.ucl.ac.be *
> * http://madgraph.roma2.infn.it *
> * -------------------------------- *
> * *
> * PARAMETER AND COUPLING VALUES *
> * *
> *****************************************************
>
> External Params
> ---------------------------------
>
> MU_R = 91.188000000000002
> aEWM1 = 132.50700000000001
> mdl_Gf = 1.1663900000000000E-005
> aS = 0.11799999999999999
> mdl_ymb = 4.7000000000000002
> mdl_ymt = 173.00000000000000
> mdl_ymtau = 1.7769999999999999
> mdl_MT = 173.00000000000000
> mdl_MB = 4.7000000000000002
> mdl_MZ = 91.188000000000002
> mdl_MTA = 1.7769999999999999
> mdl_WT = 1.4915000000000000
> mdl_WZ = 2.4414039999999999
> mdl_WW = 2.0476000000000001
> mdl_MX0 = 125.00000000000000
> mdl_WX0 = 4.0699999999999998E-003
> mdl_Lambda = 1000.0000000000000
> mdl_cosa = 0.70710680000000004
> mdl_kSM = 1.0000000000000000
> mdl_kHtt = 1.4142140000000001
> mdl_kAtt = 0.0000000000000000
> mdl_kHbb = 1.0000000000000000
> mdl_kAbb = 1.0000000000000000
> mdl_kHll = 1.0000000000000000
> mdl_kAll = 1.0000000000000000
> mdl_kHza = 1.0000000000000000
> mdl_kAza = 1.0000000000000000
> mdl_kHzz = 0.0000000000000000
> mdl_kAzz = 0.0000000000000000
> mdl_kHww = 0.0000000000000000
> mdl_kAww = 0.0000000000000000
> mdl_kHda = 0.0000000000000000
> mdl_kHdz = 0.0000000000000000
> mdl_kHdwR = 0.0000000000000000
> mdl_kHdwI = 0.0000000000000000
> Internal Params
> ---------------------------------
>
> mdl_CKM33 = 1.0000000000000000
> mdl_conjg__CKM22 = 1.0000000000000000
> mdl_lhv = 1.0000000000000000
> mdl_conjg__CKM3x3 = 1.0000000000000000
> mdl_CKM3x3 = 1.0000000000000000
> mdl_CKM22 = 1.0000000000000000
> mdl_Ncol = 3.0000000000000000
> mdl_CA = 3.0000000000000000
> mdl_TF = 0.50000000000000000
> mdl_CF = 1.3333333333333333
> mdl_complexi = (0.0000000000000000,1.0000000000000000)
> mdl_MZ__exp__2 = 8315.2513440000002
> mdl_MZ__exp__4 = 69143404.913893804
> mdl_sqrt__2 = 1.4142135623730951
> mdl_MX0__exp__2 = 15625.000000000000
> mdl_cosa__exp__2 = 0.50000002660624010
> mdl_sina = 0.70710676237309444
> mdl_kHaa = 0.99999990590454391
> mdl_kAaa = -0.0000000000000000
> mdl_kHdw = (0.0000000000000000,0.0000000000000000)
> mdl_nb__2__exp__0_75 = 1.6817928305074290
> mdl_Ncol__exp__2 = 9.0000000000000000
> mdl_MB__exp__2 = 22.090000000000003
> mdl_MT__exp__2 = 29929.000000000000
> mdl_conjg__kHdw = (0.0000000000000000,-0.0000000000000000)
> mdl_aEW = 7.5467711139788835E-003
> mdl_MW = 80.419002445756163
> mdl_sqrt__aEW = 8.6872153846781555E-002
> mdl_ee = 0.30795376724436879
> mdl_MW__exp__2 = 6467.2159543705357
> mdl_sw2 = 0.22224648578577766
> mdl_cw = 0.88190334743339216
> mdl_sqrt__sw2 = 0.47143025548407230
> mdl_sw = 0.47143025548407230
> mdl_g1 = 0.34919219678733299
> mdl_gw = 0.65323293034757990
> mdl_vev = 246.21845810181637
> mdl_vev__exp__2 = 60623.529110035903
> mdl_lam = 0.12886910601690263
> mdl_yb = 2.6995554250465490E-002
> mdl_yt = 0.99366614581500623
> mdl_ytau = 1.0206617000654717E-002
> mdl_muH = 88.388347648318430
> mdl_AxialZUp = -0.18517701861793787
> mdl_AxialZDown = 0.18517701861793787
> mdl_VectorZUp = 7.5430507588273299E-002
> mdl_VectorZDown = -0.13030376310310560
> mdl_VectorAUp = 0.20530251149624587
> mdl_VectorADown = -0.10265125574812294
> mdl_VectorWmDxU = 0.23095271737156670
> mdl_AxialWmDxU = -0.23095271737156670
> mdl_VectorWpUxD = 0.23095271737156670
> mdl_AxialWpUxD = -0.23095271737156670
> mdl_I1x33 = (2.69955542504654901E-002,0.0000000000000000)
> mdl_I2x33 = (0.99366614581500623,0.0000000000000000)
> mdl_I3x33 = (0.99366614581500623,0.0000000000000000)
> mdl_I4x33 = (2.69955542504654901E-002,0.0000000000000000)
> mdl_Vector_tbGp = (-0.96667059156454072,0.0000000000000000)
> mdl_Axial_tbGp = (-1.0206617000654716,-0.0000000000000000)
> mdl_Vector_tbGm = (0.96667059156454072,0.0000000000000000)
> mdl_Axial_tbGm = (-1.0206617000654716,-0.0000000000000000)
> mdl_ee__exp__2 = 9.4835522759998875E-002
> mdl_gAaa = 1.3008566310666950E-005
> mdl_cw__exp__2 = 0.77775351421422245
> mdl_gAza = 4.7794971072590281E-006
> mdl_gHaa = 2.5475109025056106E-005
> mdl_gHza = 3.9182129211851395E-005
> mdl_gw__exp__2 = 0.42671326129048615
> mdl_sw__exp__2 = 0.22224648578577769
> Internal Params evaluated point by point
> ----------------------------------------
>
> mdl_sqrt__aS = 0.34351128074635334
> mdl_G__exp__2 = 1.4828317324943823
> mdl_G__exp__4 = 2.1987899468922913
> mdl_G__exp__3 = 1.8056676068262196
> mdl_MU_R__exp__2 = 8315.2513440000002
> mdl_gAgg = 7.6274879753643885E-005
> mdl_gHgg = -5.0849919835762592E-005
> Couplings of higgs_characterisation_derived_haa
> ---------------------------------
>
> GC_4 = -0.12177E+01 0.00000E+00
> GC_5 = 0.00000E+00 0.12177E+01
> GC_11 = 0.00000E+00 0.46191E+00
> GC_3001h = -0.00000E+00 -0.25475E-04
> GC_3001a = 0.00000E+00 -0.00000E+00
> GC_3010a = 0.00000E+00 0.00000E+00
> GC_3010h = -0.00000E+00 -0.70263E+00
>
> Collider parameters:
> --------------------
>
> Running at P P machine @ 13000.000000000000 GeV
> PDF set = nn23lo1
> alpha_s(Mz)= 0.1300 running at 2 loops.
> alpha_s(Mz)= 0.1300 running at 2 loops.
> Renormalization scale set on event-by-event basis
> Factorization scale set on event-by-event basis
>
>
> getting user params
> Enter number of events and max and min iterations:
> Number of events and iterations 1000 5 3
> Enter desired fractional accuracy:
> Desired fractional accuracy: 0.10000000000000001
> Enter 0 for fixed, 2 for adjustable grid:
> Suppress amplitude (0 no, 1 yes)?
> Using suppressed amplitude.
> Exact helicity sum (0 yes, n = number/event)?
> Explicitly summing over helicities
> Enter Configuration Number:
> Running Configuration Number: 1
> Using dconfig= 77
> BW Setting 2 1 2 2 0 0 0 0
> Attempting mappinvarients 1 10
> Determine nb_t
> T-channel found: 0
> Completed mapping 10
> about to integrate 22 1000 5 3 22 1
> Using non-zero grid deformation.
> 22 dimensions 1000 events 22 invarients 5 iterations 1 config(s), (0.99)
> Using h-tuple random number sequence.
> Error opening grid
> Using Uniform Grid! 40
> Using uniform alpha 1.0000000000000000
> Grid defined OK
> Masses: 0.000E+00 0.000E+00 0.000E+00 0.000E+00 0.470E+01 0.000E+00 0.000E+00 0.470E+01 0.000E+00 0.000E+00
> Set CM energy to 13000.00
> Mapping Graph 1 to config 1
> Determine nb_t
> T-channel found: 0
> Impossible BW configuration
>
> ls status:
> input_app.txt
> results.dat
> run1_app.log
> ```
>
> --
> You received this question notification because you are an answer
> contact for MadGraph5_aMC@NLO.

Revision history for this message
Jona Ackerschott (jackerschott) said (last edit ):
#4

Ah, I see, sorry. This is the output of `grep -rin 'end-code not correct [0-9]`:
```
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G1.0067b0/results.dat:1:end-code not correct 137
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G1.0067c0/results.dat:1:end-code not correct 137
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G7.0067e0/results.dat:1:end-code not correct 137
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G7.0067a0/results.dat:1:end-code not correct 137
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G7.0067h0/results.dat:1:end-code not correct 137
./SubProcesses/P1_gg_ttxx0_t_wpb_wp_lvl_tx_wmbx_wm_qq_x0_aa/G3a0/results.dat:1:end-code not correct 137
```
grepping for only `end-code` gives me a bunch of lines with `echo "end-code not correct $status_code" > results.dat`, so I'm ignoring these. `ls` on all the folder gives me
```
input_sg.txt moffset.dat results.dat
```
So no log file here.

Revision history for this message
Olivier Mattelaer (olivier-mattelaer) said :
#5

Ok,

So error 137 is a system error not related to MG5aMC but to your system.
To be precise error 137 is trigger by a "SIGKILL" message sended to the script "asking" him to stop running.
One possible reason for your system to send such message might be that your system run out of memory (or other type of ressource). I do see that you run on a multicore mode with 128 core. So I guess that you do run on either a desktop (then you need at least 128Gb of RAM which is likely) or on a cluster, in that case did you request a full node -- and the full memory of the node?--

If you are on desktop, maybe try to request less core and/or monitor your ressource to check which one you run out of. If on cluster, check the cluster report for your job and potentially ask help to your IT team to see what you are doing wrong.

Cheers,

Olivier

Revision history for this message
Jona Ackerschott (jackerschott) said :
#6

Thanks, that makes sense. I am running on a cluster and I think I just didn't consider the fact that slurm might give madgraph a node with 128 cores that it would fully use, while only having 8gb of memory. I'll try if restricting the number of cores solves this.

One other question: I tried restricting the number of cores to 1. Even when I am just generating 100 `p p > e+ e-` events with `set nb_cores 1` or `set run_mode 0` using pythia the pythia shower job takes forever (seems to be stuck) and I get the following log:
```
Counting events in input LHEF, please wait.
Found 100 events in input LHEF.

Start generating events

 PYTHIA Error: input string not found in settings databases::
    unset PYTHIA8DATA
 PYTHIA Abort from Pythia::next: not properly initialized so cannot generate events
```
When I use `set nb_cores 2` instead the generation runs without a problem.

Revision history for this message
Best Olivier Mattelaer (olivier-mattelaer) said :
#7

Hi,

This might be a bug in the pythia8 support that has been fixed in 3.5.4 version of our code.

Cheers,

Olivier

Revision history for this message
Jona Ackerschott (jackerschott) said :
#8

Thanks Olivier Mattelaer, that solved my question.