beware possible double spaces in pycfg_params: https://savannah.cern.ch/bugs/?102552
make sure PSETHASH is always defined: https://savannah.cern.ch/bugs/?95570
make sure sites from SiteDB are strings, not unicode: https://savannah.cern.ch/bugs/?102436
do not allow to exceed 100MB ISB, users with a real neeed now have Crab3: https://savannah.cern.ch/bugs/index.php?102210
max_wall_clock_time shoudl work for any scheduler https://savannah.cern.ch/bugs/index.php?101064
100MB ISB for remoteGlidein, https://savannah.cern.ch/bugs/?100206
fix for https://savannah.cern.ch/bugs/index.php?99758
enforce umask permission when untarring sandbox: https://savannah.cern.ch/bugs/index.php?99592
limit w permission in wrapper, see https://savannah.cern.ch/bugs/index.php?99554
propagate to watchdog and JDL the max_wall_clock from crab.cfg for remoteGlidein, see https://savannah.cern.ch/bugs/index.php?99008
allow to set RSS limit via crab.cfg for remoteGlidein see https://savannah.cern.ch/bugs/index.php?97656
max 5000 job on crab server https://savannah.cern.ch/bugs/?98597
allow user to override data location, see https://savannah.cern.ch/bugs/index.php?97801
allow 5K job/task for remoteGlidein
better messages
fixed savannah bug 94491
zip Watchdog log https://savannah.cern.ch/bugs/?95059
add diagnostic printouts when looking for /cvmfs
add crabWatchdog.sh to ISB, allow for CVMFS on OSG: https://savannah.cern.ch/bugs/index.php?93391 and https://savannah.cern.ch/bugs/?94220
last fix was missing self. :-(
proper check for dataseth path format, fix for https://savannah.cern.ch/bugs/?67885
make sure proxy protection is not overridden, fix for https://savannah.cern.ch/bugs/?87233
print the content of the software dir
rollback for savannah 85243
try to remove the crab working_dir if the task has problem during creation step, savannah bug 85243
changed file permission in a+w for savannah bug #78585
rollback #82422
remove "data" dir check
fixes for additional infos reporting to dashboard
more info to dashboard
fix for scrip_exe. Set only script name and not full path
changed url for CMSSW available releases download, savannah bug 79277
modify datasetpath check, savannah bug 77899
Ignore release type, patch from Belforte
ISB limit fixed to 100MBcms_cmssw.py
add ISB size check with server
fixed for old executable_exit_status value, savannah bug 73035
Restrict characters in output files
Inadvertent checkin of cms_cmssw.py
Cleaner logic on retriveValue and throw exception for split by run with no runselection
for multiout
Enforce split by lumi for data, enable split by lumi with runselection but no lumi mask
merge 2.7.1_branch
fix for bug 62338
fixed logger message
Fix for bug #65439: updated URL for Server_available_for_users
removed some print
add LumiMask support
Use binary pickle file, put in ISB
Enable split by lumi for any dataset
changes for json file
Add check on CMSSW releases from https://cmstags.cern.ch/cgi-bin/CmsTC/ReleasesXML
merging LumiMask..
\2\7\1\ \f\i\x\e\s\ \f\r\o\m\ \H\E\A\D
deprecated glite_slc5 and glitecoll
Code to allow lumi-filtering (pseudo-ADS)
Revert accidental changes
Note on CAF's in Submitter, clarify
Always use listFiles for ADS, change info message for ADS
Branch for DBSMART, probably never will be used
cfg_params is in lower letter for skip_tfileservice_output
Fix for unique output name when return_data=1 is used
Set SCRAM_ARCH on all middlewares
Fix for bug #58043: now stageout uses unique names for the files sent to SE
support for glite_slc5 scheduler parameter to enable GLITE CLI usage
changed for backup copy to close SE. adding temp in the path dir and wrapper exit code 60308
typo for pbs
add support for WMBS
alignment with head
fill ouptputBaseName field
adding SchedulerPbs
Change first_run: defaults to 1 for MC, integer increment, not string, and not used in cms_cmssw.py
added a check to validate the syntax of the input dataset name, savannah bug 55539
added a check to validate the syntax of the input dataset name, savannah bug 55539
changes related CopyData, savannah task 8700
changes related CopyData, savannah task 8700
update twiki links
Changed link for server available to users
Code and documentation for passing parameters to pset.py
Drop support for CMSSW 1.x, 2.0 (.cfg language and old-style random #s)
Limit job creation for stand-alone, non-local schedulers
Move from limiting # of created jobs to # of submitted jobs
fix for multiple args
Protect case where dataset=None
Don't lose message, detect ADS properly
changes related publication name too long, savannah task 10664
Add config parameters for lumi-based splitting
Detect ADS, not user set, and fetch lumis for ADS
Crashing bug in 261 avoidable by the user
type, self.CMSSW_major, self.CMSSW_minor, self.CMSSW_patch = tuple(self.version.split(_) cannot works for pre release & special tag
added file_list as argument of ModifyJobReport, savannah task #10430
typo
Solve case where multiple output modules present and case where modules present and not in endpath (Nicola)
fix for bug #52484. Add arguments.xml to the ISB files list. not more included in default.tgz
raise exception for more than 500 jobs and not server
added the check of length of datasetname to publish
remove default.tar after zipping
changes to allows use of CREAM ce by setting GRID.use_cream = 1
changes for script_exe support
add support for script_arguments (to be used with script_exe). Fixed problem using CMSSW pre-release
Restore generators other than pythia, add lhe
add CMSSW.ignore_edm_output to allow production of EDM output w/o saving it anywhere
print CMS names for destination SE rather than plain grid names
Introduce a CrabLogger class which hanlde the std python logging
add crab.cfg to default.tgz. strongly requested by David Braun
improved logging
mesg from info to debug
adapting code to EDG --> GRID migration
typos. added exception
adapting code to logging usage. (crab_logger removed)
fix problem in defining psetname
movig the call to edmConfigHash _after_ the untarring of user code bug #50798
Warning if skipBadFiles is True
re-enabled -extend functionality. Revisited default.tgz preparation
typo
WMCore/__init__.py needed in the ISB
add control to see if a PoolOutputModule is present but the corresponding file is not in the output_files list. It would just create a, possibly, huge file on remote WN with no purpose, but stress the WN itself
removed not needed code. Temporary commented check processed files. Cannot works given the arguments changes
Use pickled config file and problems with triple-quote commenting.
argsList must be always 1... to be improved
Write xml with job arguments. Changes needed to solve problem: #39187
add SEName, LFN and PFN to the FJR in the case of copy without publication for edm and not edm files
minor cleanup
prepare correctly cms environment also for lsc/caf
Add some ARC specific bits to the jobs scripts.
iniatialize variables
indentation
ModifyJObReport call only for data publication, for the copy we have to consider also no edm file
fixed old debug_wrapper true
remove debug print out
Fix broken (during SGE patches) WN script on OSG.
sge schedeuler specific support
changed call to ModifyJobreport to write in the fjr the LFN and SE name even in case of copy without publication
More clear messages
exeExitCode = 30001 does not block the copy of output to SE
diff -q --> diff -qbB as suggested by StefanoB
export SOFTWARE_DIR and RUNTIME_AREA variables.
Remove support for CMSSW < 1.5, old method of adding FJR
write CMSSW version to log
fixed bug with exe_exit_status
moved the check of exe_exit_status value in order to avoid the copy of output to SE in the case of executable failure
set correctly maxEvent for mc prod jobs
adapt to Splitter changes
preserver order of blocks
code reorganization. Added support for Run Based splitting.
added warning if no output files are defined nor in the crab.cfg neither in the cmssw configuration
minor change in message related to not hosted data
fix for no_block_boundary configurable
Force skipEvent to be zero in CMSSW.py in case user has set it
Raise exception if no location is found for any blocks
reduce redundant debug and protect against splitting 0 jobs
Add support for generator=comphep
Off by one for MadGraph
Do it right for MadGraph
Do it right for MadGraph
Add support for generators like madgraph
added first part of code to support remote dir check at creation time. (not finished)
dereference link for tar-ball also for lib/ and module/
print tar-ball content on WN only if debug_wrapper is set
Don't pass cfg_params to SiteScreening
only additional_input_files are dereferenced while building the ISB tarball
dereference link into ISB tarball
set wrapper_exit_code before call exit_func also when cmsRun exit with code !=0
Transition to WMCore SiteScreening and SiteDB
no isb limit using caf/lsf
added ProcessedDataset for publication
fixed some typos
added abs path to $file_list
changed sintax for cmscp
include ProdCommon/Storage within the ISB
scriptName filed not more needed in BL DB. Adapted CRAB code
fix for ui_working_dir absolute path
Logging tarring error exception message
typo
Removed dependence from LFNBase (reduce n° of SiteDB query). Added cmscp to ISB. Improved msg ISB size related adding reference to "how to use server" twiki. Changed modifyReport: it must start with "ws" following the related convention. minor code cleaning
*** empty log message ***
add prototype for jobsplitting without block boundaries constraint. Disabled by default (enabled only if CMSSW.noblockboundary=1)
added protection if datasetpath=""
removed uncommented wrong variable
ISB limit automatically removed while using the server. Improved a bit the error message to limit (hopefully) HN formum spamming
missing parenthesis
wrapper exit if cmsRun fails. bug #39675
fix for problem on 3001 running on both parents and child (https://hypernews.cern.ch/HyperNews/CMS/get/crabDevelopment/837/1/1/1.html)
changes related the new LFN and new datasetpath to publish (task 7471)
Restore the glite requirement to run only on Production CE
Allow patterns like T1,T3
Raise exception if old seed manipulation is attempted
added changes for PU_dataset and the LFN in the case of publication for CAF and LSF
Fix for edmConfigHash with CMSSW 2.1 on CRAB 2.3.1 branch
Fix for edmConfigHash with CMSSW 2.1
move rfcp function from SchedulerLocal to cmscp in crab_template and added some changes for publication using CAF and LSF
Fix typo
Fix on a typo
some changes according to the head of CRAB (24/06/2008)
dataset_pu moved from USER to CMSSW section
missed $... $Njob is a variable
removed the control of input file in the case of publication with PU
add IOError to exception list while creating the tar-ball
allow automatical addition of EDM output file produced via PoolOutputModule to the list of output_files.
enable hierarchical crab.cfg
fixed bug about publication
Correct print when output file is not in crab.cfg/TFileService
automatically add file produced via TFileService to list of output files if present
fix for correct tar ball creation with needd ProdCommon stuff (and not all)
also CMSConfigTools is needed
fix for data
dif for "data" dir discovery and for double ProdCommon
fixed tarbal preparation bugs
restored symbolic link...
added support to run on dataset with new blocks , skipping the already analyzed. Plus some minor cleaning
move func fileNumber from cms_cmssw to crab_util
improvement in dir walking to find data/ directory, thanks to Brian bbockelm@cse.unl.edu
remove the double ProdCommon dirname
fully added support for two tier inputs
just renamed JobReportErrorCode.py to fillCrabFjr.py, a minor cleaning on crab_template
syntax
check for tot_nevent = -1
fix for not processed files
moved some ls -Al to high level debug
Don't make broken links. Still needs full testing.
moved from debuggin_pset to more general debugging_wrapper. Plus major changes on parse framework job report script. Now it take arguments and return different output according to the requests.
Switch to single quotes
more check on executable_exit_status from FJR
Redo escapes
added protection for n_tot_event < n_job
Escape python to more or less match original .sh
fix for additonal input file
indentation
Improved the fjr parsing... now the related wrapper code has been written by cms_cmssw. It allow to fix the problem due to the patch for missing input files
Printing CMSSW Pset in STDOUT is now optional. Specify USER.pset_debug=True to enable the print. Updated related documentation
removed some confusing comments
must handle also CMSSW_X_Y_Z/src dir...
Adapt again to new random # rules in CMSSW_2_1_x
Finalize support for 2_1_x and supplying python config file in crab.cfg
changed variable major --> self.major_version
Remove PsetManip150
bug fix
chatched error with eval scramv1
now the input sanbox is just one tarball plus the wrapper script
reimplemented dashboard communication sending again both pre and post submssion infos, plus many Submitter code reorganization
fix bug in splitting, in case remaining event per block are less than events per job
Many changes to have LSF working with BossLite Introduce Killer class to handle -kill which works again Work_space::res() return the correct output directory also in case user has set a non default one, Likewise for logDir() USER.outputdir is not to be used anywhere outside workspace class Some cleanup in submit logic, to reduce call of Scheduler specific classes from Submitter.py crab -clean works as well (well, almost, still need to remove twice the directory) Fill startDirectory and outputDirectory to Task GetOuput check status and not schedulerStatus (not stantard) Some cleanup in the use of BlackWhiteListParser No explicit check of scheduler concrete type in Submitter at listMatch level: move different behaviour in SchedulerXYZ implementation Plus other things I'm forgetting... Stefano
re-added the copy_problem LFN for data publication
fixed problem with previous commit
fix for data publication
cope with RELEASETOP found in different areas od .SCRAM
cope with RELEASETOP found in different areas od .SCRAM
Include changes from 2_1_1 to 2_1_2 in head
Fixes for CMSSW on branch
Fix problems with mv
restored output functionality. Some fix on arguments format avoiding wrapper crash. Implemented tarball creation for output sanbox... still remaining to enable the check on output sandbox size...and the related backup solution
the executable in jdl must be without absolute path. The wrapper must be included within the inifiles
still another fix on the job index plus bosslite integration optimization.
fix publication
patch for publication (store directory in PFN not compulsory)
changes to write the FJR from beginning of job wrapper
Many changes to integrate BossLite. Creation step fully implemented and optimized... Submission is now working again. Here the missing things are the support for jobs submission by range, the message sending to ML, and the listmatch_match check. Actually the requirements can be changed on the fly as was in the past. The status is fully working with BossLite. The exit code display is not there since the new boss does not implement the RealTime mon.. here the functionality is under development by Federica: to be integrated.
code number for OSG WORKING_DIR could not be deleted on WN is now 10017
re-added some changes for clean of wrapper
removed jobDB and TaskDB usage. now works with sqlite
first changes about wrapper
Python config file, remove reliance on sed
placeholders for CMSSW_2_0_0 pycfg manipulation
add requirement for CE in Production state for glitecoll scheduler
adjustements in check OSB size. std* where double counted in case of multiple outputs
Consistent case
more verbose message if white list is used and no site can be selected, plus similar update in documentation
Redesign the inheritance three of Scheduler Classes, and remove the SchedulerBoss one Introduce ScheuledGrid and ScheduleLocal as base classes for Grid and local scheduler, respectively. All interaction with boss are done via Boss class, whose (unique) instance is owned by Scheduler These changes are done to reduce code duplication. Plus other minor modification and cosmetcs, as usual
remove all use of try: catch: and replace with if
removed ProdAgent API dependencies for DBS publication
Add support for LSF/CAF direct submission Re-establish a correct inheritance pattern for Scheruled* classes Start to remove some unneeded try: catch: statement and replace them with appropriate if:then: Erase any use of cfg_param as a common block (expecially in conjuction with apmon) and replace it with the user of task DB Several minor cleanup
changed a message about no produced output file
I have forgotten a + in txt
fixed problem with output copy and check output sandbox size
Don't move output file if copy_data (for OSG)
fixed a syntax mistake
add ls
Bug fix
Adjustement for output truncating
Bug fix on output limit
remove a double copy of output and added a symlink
Fixing problem on arguments check (WN)
Set FJR file correctly for newer CMSSW versions
fix bug in setting sourceSeed from cfg
remove support for DBS1, and remove DBS2 string from files/class
added echos
Remove references to GRID3
removed cmssw and architecture check for version <1_5_X
changes to improve crab stdout
Better error checking for temp directory
Remove EdmConfigToPythong dependency
fixed printout for pythia job
fixed printout for pythia job
fix bug in setting path for private executable on WN
fix bug in setting path for private executable on WN
fixed the output creation control
fixed the output creation control
fixed print message when a block is not found
fixed print message when a block is not found
Removing not proper print out
Removing not proper print out
Allowing submission for tasks with a missing location for a block - this is a first version that block the creation and then the submision of jobs assigned to no location
Recupering missing line on output_exit_status
Allowing submission for tasks with a missing location for a block - this is a first version that block the creation and then the submision of jobs assigned to no location
added control for output production before copy
added a control about output production, before copy
crab_1_5_4_slc3_pre1
the job report will be modified only for data publication
Added limit on output sandbox for glite -105 MB
Sending full datasetpath to the dashboard
Fix for jobs with a missing output file
bug fix for g4Seed and mixSeed in pset manipulator
added a output file check
bug fix in line 1192, removed additional '
re-added old changes
re-aligned version
very temporary roll-back
Stops job creation in case of arch-sw version mismatch
fix for user ui_working_dir
Problem setting up software for >= CMSSW_1_5_2 on OSG T2 sites solved. Introduced general setting of BUILD_ARCH before setting up the CMS software made the scramv1 p command not see the correct release area. Changed so that only in the case of LCG, this is set, on OSG, it is not.
add ARCH requirement also for SL3
moved parsing of black/white se lists into BlackWhiteListParser.py which is intantiated in SchedulerEdg.py: list parsing during xml creation cms_cmssw.py: list parsing for screen output also SchedulerGlite.py has to use new member functions and had to be changed
- added -e to cmsRun execution of CMSSW >= 1_5_X - fixed requirement for selecting software version and architecture in jdl, now, both are added to the jdl
cosmetics in checking output file existence
add SCRAM_ARCH as requirement in jdl _only_ for SL4. Plus cosmetics
fix sorting in screen output
not needed to modify FJR if publish_data = 0
export SCRAM_ARCH and BUILD_ARCH before source cmsset_default.sh execution
added slash in FOR_LFN
fixed tipe error --> modifyReport_result
changes to run ModifyJobReport in OSG env
printout removed
put additional input files into a tgz _and_ mods to cope with new syntax in pset for maxEvents for CMSSW>150
changed ApplicationFamily name
copy pset.cfg from $RUNTIMEAREA to topdir, lost in 1.90
changes to publish option
changes for DBS2 output publication
Changed job creation and output to be independent of black and white lists. All jobs are created after data discovery and job spitting using black and white lists independent from the entries in the destination list of jobs. If a job does not have any valid destination sites, it is nontheless created. After job splitting, a screen output listing all blocks and all jobs belonging to a block is shown listing available sites per block. The user can submit x number of jobs while CRAB will skip in addition to the normal job submission all jobs which have an empty destination list.
temporarly comment arch requiremtns for 153_pre1
re-introduce handling of various random seeds, lost in version 1.85
add VO-cms-$SCRAM_ARCH as a requirement to jdl
fix for reporting correct error code if output file is missing
export SCRAM_ARCH if grid is OSG
merge version 1.84 with 1.83
Again the fix on additional input files and relative paths
move setting of SCRAM_ARCH before actual creatio of scram project
move import of DBS/DLS stuff to beginning of DataDiscoveryAndLocation. They are needed noly there and beside provide a cleaner error message when trying to create w/o eval scram runtime
InputSandBox max size default to 9.5 Mb
removed some line added by cvs update?? (<<<<<<< cms_cmssw.py)
DBS2 is default
Fix for relative path for additional input files
input sandbox is set by default to 9.5 Mb: actual liit is 10, 1/2 Mb is for crab stuff
add handling for g4SimHits and mix random seeds and cleanup the sed manipulation in wrapper script
fix bug in checking existence of additional_input_files
DBS tag changed to latest tag: DBS_1_0_0_pre6 DBS default URL changed to : http://cmsdbsprod.cern.ch/cms_dbs_int_global/servlet/DBSServlet DBS api version changed to : v00_00_06 new format of datasetPath: /primaryDataset/processedDataset/Tier Parsing for ML report adapted accordingly
fixed case when user defined executable is not within the CMSSW user project area
User defined executable was packed into wrong subdirectory in default.tgz and could not be found on wn corrected to be packed on the top level of defalt.tgz
Provide check for default.tgz tarball size to limit input sandbox size for all submission systems. Condor_G has no sandbox size check as EGEE does, so a size check on the default.tgz tarball which now also contains the data directories was introduced. The default is 100 MB. It can be changed by: [EDG] maxtarballsize = X where X is the size in MB (float). The switch is not documented in the template crab.cfg not to mislead users.
set SCRAM_ARCH on WN to the one got from user environment
re-establish submission and execution of user script
Prevent crash in case of empty fileblock.
cms_cmssw.py
Add dbs_version card for steering the api version of the selected dbs server, default v00_00_05. Improved exception handling
Added card "use_dbs_2" to crab.cfg to steer usage of DBS-1 (default) or DBS-2
added last job splitting combination (event_per_job & number_of_job) for NON input mode
fix check existence of additional input files before doing anything with them
many minor fixes reported by pychecker, mostly unsude import and unused variables
cleanup
rewrite the creation of tar-ball files, using the tarfile module from python rather than opening a shell and do tar -czvf... Moreover, non need to copy anything in swArea for ProdAgent stuff not ML
add all data dir found in swarea to tar-ball
add option to tar to dereference symlinks
set relative path also for additional input files
added "first run" parameter managment for pythia job
add code version to DB
ML files added to the tgz
put sw version on TaskDB
fix to change name, adding _NUM, also to FJR
The FJR produced by each jobs is always returned via output sandbox even if the ouput is copied to a SE, as already happens for stdout and err
forgot printout
strip whitespaces from file names
added FrameworkJobReport crab_fjr.xml per default to output sandbox, copy ProdAgentApi directory from to swAera of local USER CMSSW project, include ProdAgentApi into PYTHONPATH on WN
cosmetics
commented out last call to analisys_common_info
remove common.analisys_common_info
correct variable replacement in CMSSW parameter-set
allow wildcards in additional input files
fixed splitting logic, added forgotten increase of counter, changed event caluclation for last job
working dir creation on OSG moved to mktemp
bug fix to write correctly xml in case of None datasetPath jobs
BOSS4 + sub-file splitting + taskDB
minor bug fix
add iterator for xml and other changes related to the xml sintax
Fixed cmsset.sh dir for OSG sites
fixed argument replacing in pset
introduced sub fileblock splitting and added discovery only for requested jobs (-create 100)
first changes to use iterator
Gets events/file. Requires cvs head of DBS/Clients/PythonAPI
targetSite sent to dashboard is job specific
Fixed parsing bug
Commit for Malina: Handled block-splitting for CMSSW and CE->SE in .jdl files
Burt Holzman fix for arguments and VO_CMS_SW_DIR/cmsset_default.sh
Burt Holzman fix for arguments and VO_CMS_SW_DIR/cmsset_default.sh
Burt Holzman fix for arguments and /uscmst1/prod/sw/cms/cmsset_default.sh
fix for xml syntax using cmssw
Shell script strips full path from additional_input_files and uses only the file name for copying from RUNTIME_AREA to the scram project directory
printout removed
additional_input_files is in USER section, not in CMSSW
cosmetics for VTx set
add manipulation VtxSmeared
copy pset to pset.cfg even if untouched: bugfix
better messages
copy pset to pset.cfg even if untouched
fix problem in splitting and modify seed only if defined
better job splitting for cmssw
splitting per events
simplify and fix handling of list in JobDB
fix probem if ask more events than available
fixed bug related to total number of file and jobs
fix problem in job arguments definition
fix bug if asked all events but more files per job than total available
minor fix in printout
removed GridSyncJobId from the fields which are reported every time from the WN script to the DashBoard
changes WN wrapper script ML reporting to report only the relevant information (repo is deleted after report to ML) at any stage of WN script
changed common.logger.message for total_number_of_events
commented a print
support for None input
support for "none" datasetpath" and split by events
added taskId to be reported at Submission time
moved initial DashBoard reporting to scheduler, added failure codes for OSG WN tmp directory removal, fixed bugs in DashBoard exitcode reporting (no spaces)
moved SyncGridId creation in wrapper script to front that error exit codes during job setup are reported correctly to the DashBoard
added ML reporting of application, dataset, owner and fixed monitoring for status check for condor-g
add optional (secret) switches for DLS type (both in ORCA and CMSSW jobtype) and for DBS instances (in CMSSW jobtype).
Integrate CMSSW: added OSG capabilities
bug fix in copying multiple files on SE. BTW same code in 4 different classescvscheckcvscheck BAD!
initial version of cmssw jobtype
This form allows you to request diffs between any two revisions of this file. For each of the two "sides" of the diff, select a symbolic revision name using the selection box, or choose 'Use Text Field' and enter a numeric revision.