Test of CMSSW_3_10_0_pre7io

Test used: CPT, with MTR3 job, some simple job config for GSIdcap test (added by Christoph to Leo's page)

Site SE Technology MTR3 32bit MTR3 64bit
T2_BE_IIHE dCache smile  
T2_CH_CSCS dCache smile  
T2_ES_CIEMAT dCache smile  
T2_FR_GRIF_LLR dpm frown  
T2_IT_Bari Lustre+Storm smile  
T2_DE_DESY dCache(GSIdcap) frown frown
Overall   frown frown

Summary:

  • Still issues with DPM without Sartirana's hack
  • 64bit release breaks:
    • glite, due to bad (?) compiled libssl
    • CRAB, due to shipped python libraries v 2.6.4-cms9 (2.6.4-cms6 works)
  • dCache with GSIdcap still does not work properly

32 bit

  T2_BE_IIHE 20101206 T2_BE_IIHE 20101206 T2_CH_CSCS 20101206 T2_CH_CSCS 20101206 T2_ES_CIEMAT 20101206 T2_ES_CIEMAT 20101206 T2_FR_GRIF_LLR 20101206 T2_FR_GRIF_LLR 20101206 T2_IT_Bari 20101206 T2_IT_Bari 20101206
Success 100.0% (20 / 20) 100.0% (20 / 20) 100.0% (20 / 20) 100.0% (20 / 20) 100.0% (20 / 20) 100.0% (20 / 20) 80.0% (16 / 20) 80.0% (16 / 20) 55.0% (11 / 20) 55.0% (11 / 20)
Error 8020 // // // // // // 20.0% 20.0% 45.0% 45.0%
CpuPercentage 60.30 +- 6.76 37.35 +- 19.29 58.25 +- 1.95 53.50 +- 3.04 55.10 +- 2.62 40.35 +- 1.77 73.69 +- 1.10 68.62 +- 7.25 59.27 +- 13.70 66.55 +- 5.05
TimeJob_AvgEvent 0.07 +- 0.01 0.20 +- 0.12 0.12 +- 0.01 0.12 +- 0.01 0.06 +- 0.00 0.08 +- 0.00 0.06 +- 0.00 0.08 +- 0.01 0.11 +- 0.03 0.08 +- 0.01
TimeJob_Exe 697.35 +- 81.80 1983.55 +- 1234.27 1184.80 +- 102.38 1246.45 +- 101.34 596.50 +- 30.37 835.70 +- 39.09 579.81 +- 11.63 849.62 +- 143.39 1081.82 +- 258.96 819.91 +- 126.14
TimeJob_MaxEvent 16.29 +- 5.75 407.83 +- 501.33 15.78 +- 0.87 17.01 +- 1.15 12.42 +- 2.37 19.48 +- 11.42 4.17 +- 1.06 12.44 +- 5.11 39.86 +- 65.53 10.10 +- 3.55
TimeJob_MinEvent 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00 0.01 +- 0.00
TimeJob_Stageout -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00 -1.00 +- 0.00
TimeJob_Sys 16.48 +- 3.09 10.48 +- 1.03 10.96 +- 0.87 11.67 +- 1.39 7.04 +- 0.88 8.73 +- 0.64 8.89 +- 1.16 12.45 +- 2.45 20.32 +- 6.01 14.27 +- 2.93
TimeJob_TotalJob 668.64 +- 80.75 1961.43 +- 1234.12 1166.93 +- 102.12 1231.38 +- 101.13 577.26 +- 29.18 816.41 +- 36.05 565.88 +- 10.57 834.69 +- 141.44 1063.38 +- 254.86 807.21 +- 125.55
TimeJob_User 402.27 +- 5.37 505.94 +- 67.63 686.35 +- 66.29 660.49 +- 61.41 323.99 +- 6.12 331.39 +- 9.06 421.67 +- 6.50 575.47 +- 122.99 593.29 +- 66.37 540.31 +- 124.12
TimeJob_Wrapper 710.20 +- 80.10 1997.10 +- 1233.99 1202.40 +- 101.82 1262.30 +- 100.93 620.20 +- 19.51 848.60 +- 40.83 601.81 +- 4.39 864.00 +- 146.64 1112.09 +- 279.33 832.27 +- 126.45

Notes:

  • Bari failed because the files are not physically available (thanks Giacinto!)
  • GRIF had a similar issue, solved (thanks Andrea!)

DPM crashes without A.Sartirana's workaround

Actually, on DPM he release is not working WITHOUT THE WORKAROUND described here. In order to avoid the hack, a customized job has been sent to GRIF:

#!/bin/bash

LOG="cmssw"

eval `scram ru -sh`

env
export LD_LIBRARY_PATH=`echo $LD_LIBRARY_PATH  | sed "s=/opt/exp_soft/cms/mylib:==g" `
echo "--------------------------------"
env

cmsRun -j ${LOG}.xml pset.py

where /opt/exp_soft/cms/mylib is where the "hacked" libraries are. Without them, a new error appears:

07-Dec-2010 19:23:24 CET  Initiating request to open file rfio:/dpm/in2p3.fr/home/cms/trivcat/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000/D005BB56-CA2B-DF11-BA08-0030487C60AE.root
%MSG-e Root_Error:  file_open TUnixSystem::DispatchSignals()  07-Dec-2010 19:23:24 CET pre-events
segmentation violation
%MSG
Attaching to program: /proc/16136/exe, process 16136
[Thread debugging using libthread_db enabled]
[New Thread 0xf3953b90 (LWP 16138)]
[New Thread 0xf4354b90 (LWP 16137)]
0xffffe410 in __kernel_vsyscall ()
Thread 3 (Thread 0xf4354b90 (LWP 16137)):
#0  0xffffe410 in __kernel_vsyscall ()
#1  0xf57fdc4e in do_sigwait () from /lib/libpthread.so.0
#2  0xf57fdcef in sigwait () from /lib/libpthread.so.0
#3  0xf45290c1 in globus_l_callback_thread_signal_poll (user_arg=0x0)
    at globus_callback_threads.c:2841
#4  0xf4540293 in thread_starter (temparg=0xa46a450)
    at globus_thread_pthreads.c:508
#5  0xf57f5832 in start_thread () from /lib/libpthread.so.0
#6  0x00704f6e in clone () from /lib/libc.so.6

...

dCache with GSIdcap

File access fails with authentification problems in GSIdcap. The problem is only appearing if openssl libraries are loaded from CMS software area. In cases where the cmsRun does not access Frontier (no realistic use case), openssl libraries are loaded from the OS and GSIdcap works.

Using Frontier the failure looks like this:

Error ( POLLIN POLLERR POLLHUP) (with data) on control line [119]
Failed to create a control line
Failed open file in the dCache.
%MSG-w StorageFactory::stagein():  PoolSource:source@sourceConstruction  16-Dec-2010 11:03:02 CET pre-events
Failed to stage in file 'gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000//0EDC7961-822C-DF11-A0BE-001617E30E2C.root' because:
---- DCacheStorageMaker::stagein() BEGIN
Cannot stage in file 'gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000//0EDC7961-822C-DF11-A0BE-001617E30E2C.root', error was: Server rejected "hello" (dc_errno=26)
---- DCacheStorageMaker::stagein() END

%MSG
16-Dec-2010 11:03:02 CET  Initiating request to open file gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000//0EDC7961-822C-DF11-A0BE-001617E30E2C.root
Error ( POLLIN POLLERR POLLHUP) (with data) on control line [119]
Failed to create a control line
Failed open file in the dCache.
%MSG-s CMSException:  AfterFile 16-Dec-2010 11:03:02 CET pre-events
cms::Exception caught in cmsRun
---- FileOpenError BEGIN
---- StorageFactory::open() BEGIN
Failed to open the file 'gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000/0EDC7961-822C-DF11-A0BE-001617E30E2C.root' because:
---- DCacheFile::open() BEGIN
dc_open(name='gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000/0EDC7961-822C-DF11-A0BE-001617E30E2C.root', flags=0x0, permissions=0666) => error 'Server rejected "hello"' (dc_errno=26)
---- DCacheFile::open() END
---- StorageFactory::open() END

RootInputFileSequence::initFile(): Input file gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000//0EDC7961-822C-DF11-A0BE-001617E30E2C.root was not found or could not be opened.

Error occurred while creating source PoolSource
---- FileOpenError END


%MSG

I did not succeed to get this working by putting symbolic links to the OS openssl libraries. (Strange).

64 bit

Forcing 64 SCRAM arch with: export SCRAM_ARCH=slc5_amd64_gcc434

libssl issue

Issues with libssl.so library shipped with the release: voms-proxy-* segfault. Something in the build is wrong. Crosscheck: substituting slc5_amd64_gcc434/external/openssl/0.9.8e-cms2/lib/libssl.so with /lib64/libssl.so.6 solves the issue. Same version, only difference:

[leo@t3ui02 lib]$ file /lib64/libssl.so.0.9.8e
/lib64/libssl.so.0.9.8e: ELF 64-bit LSB shared object, AMD x86-64, version 1 (SYSV), stripped
[leo@t3ui02 lib]$ file /shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/openssl/0.9.8e-cms2/lib/libssl.so.6
/shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/openssl/0.9.8e-cms2/lib/libssl.so.6: ELF 64-bit LSB shared object, AMD x86-64, version 1 (SYSV), not stripped

CRAB segfaults

After the substitution, issues with CRAB:

[leo@t3ui02 test]$ crab -create -submit
crab:  Version 2.7.5 running on Wed Dec  8 15:57:50 2010 CET (14:57:50 UTC)

crab. Working options:
        scheduler           glite
        job type            CMSSW
        server              OFF
        working directory   /shome/leo/IntegrationTests/CMSSW_3_10_0_pre7io_64bit/src/Tests/TestJobs/test/T2_CH_CSCS-MTR3-RelValProdTTbarJobRobotMC_3XY_V24_JobRobotv1-10000-CMSSW_3_10_0_pre7io_64bit-201012081029/

/swshare/CRAB/CRAB_2_7_5_patch1/python/crab: line 34:   603 Segmentation fault      python $CRABPYTHON/crab.py $*

Using pdm module:

cd /shome/leo/IntegrationTests/CMSSW_3_10_0_pre7io_64bit/src/Tests/TestJobs/test
gridinit
export SCRAM_ARCH=slc5_amd64_gcc434
export VO_CMS_SW_DIR=/shome/leo/CMSSW_RELEASES/
source /shome/leo/CMSSW_RELEASES/cmsset_default.sh
cmsenv
crabinit
source /swshare/CRAB/CRAB_2_7_5_patch1/python/crab
[leo@t3ui04 test]$ python -m pdb $CRABDIR/python/crab.py -create -submit
> /swshare/CRAB/CRAB_2_7_5_patch1/python/crab.py(2)<module>()
-> import sys, os, time, string
(Pdb) b /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/CmsSiteMapper.py:212
Breakpoint 1 at /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/CmsSiteMapper.py:212
(Pdb) c
crab:  Version 2.7.5 running on Fri Dec 10 12:33:23 2010 CET (11:33:23 UTC)

crab. Working options:
        scheduler           glite
        job type            CMSSW
        server              OFF
        working directory   /shome/leo/IntegrationTests/CMSSW_3_10_0_pre7io_64bit/src/Tests/TestJobs/test/T2_CH_CSCS-MTR3-RelValProdTTbarJobRobotMC_3XY_V24_JobRobotv1-10000-CMSSW_3_10_0_pre7io_64bit-201012081727/

crab:  WARNING: CMSSW_3_10_0_pre7io on slc5_amd64_gcc434 is not a supported release. Submission may fail.
crab:  Contacting Data Discovery Services ...
crab:  Accessing DBS at: http://cmsdbsprod.cern.ch/cms_dbs_prod_global/servlet/DBSServlet
crab:  Requested dataset: /RelValProdTTbar/JobRobot-MC_3XY_V24_JobRobot-v1/GEN-SIM-DIGI-RECO has 300000 events in 1 blocks.

> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/CmsSiteMapper.py(212)load()

...

> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(123)load_report()
-> def load_report(self, report):
(Pdb) l
118             params = urllib.urlencode(params)
119             fd = urllib2.urlopen(self.sitedb_url, params)
120             return fd
121
122
123  ->     def load_report(self, report):
124             """
125             Load the SiteDB report and return the DOM contents.
126
127             Great care is taken to make this resilient, including timeouts and
128             fallback to the contents of the cache.
(Pdb)
129
130             @param report: The name of the SiteDB report
131             @returns: The DOM object representing the contents of the report
132             """
133
134             # Start off by setting the alarm clock to timeout faulty operations.
135             def interrupt_op(*args):
136                 raise AlarmClock()
137             signal.signal(signal.SIGALRM, interrupt_op)
138             try:
139                 # Check the contents of the local cache; only return here if they
(Pdb)
140                 # are fresh and parse cleanly.
141                 try:
142                     signal.alarm(self.alarm_timeout)
143                     fresh, results = self.check_cache(report)
144                     if results:
145                         try:
146                             results = parse(results)
147                             if fresh:
148                                 return results
149                         except:
150                             results = None
(Pdb) n
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(135)load_report()
-> def interrupt_op(*args):
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(137)load_report()
-> signal.signal(signal.SIGALRM, interrupt_op)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(138)load_report()
-> try:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(141)load_report()
-> try:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(142)load_report()
-> signal.alarm(self.alarm_timeout)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(143)load_report()
-> fresh, results = self.check_cache(report)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(144)load_report()
-> if results:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(145)load_report()
-> try:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(146)load_report()
-> results = parse(results)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(147)load_report()
-> if fresh:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(156)load_report()
-> try:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(157)load_report()
-> signal.alarm(self.alarm_timeout)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(158)load_report()
-> urlresults = self.load_siteDB(report)
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(159)load_report()
-> try:
(Pdb)
> /swshare/CRAB/CRAB_2_7_5_patch1/external/ProdCommon/SiteDB/SiteDBReport.py(160)load_report()
-> urlresults = parse(urlresults)
(Pdb) s
--Call--
> /shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/python/2.6.4-cms9/lib/python2.6/xml/dom/minidom.py(1914)parse()
-> def parse(file, parser=None, bufsize=None):
(Pdb) l
1909        toktype, rootNode = events.getEvent()
1910        events.expandNode(rootNode)
1911        events.clear()
1912        return rootNode
1913
1914 -> def parse(file, parser=None, bufsize=None):
1915        """Parse a file into a DOM by filename or file object."""
1916        if parser is None and not bufsize:
1917            from xml.dom import expatbuilder
1918            return expatbuilder.parse(file)
1919        else:
(Pdb)
1920            from xml.dom import pulldom
1921            return _do_pulldom_parse(pulldom.parse, (file,),
1922                {'parser': parser, 'bufsize': bufsize})
1923
1924    def parseString(string, parser=None):
1925        """Parse a file into a DOM from a string."""
1926        if parser is None:
1927            from xml.dom import expatbuilder
1928            return expatbuilder.parseString(string)
1929        else:
1930            from xml.dom import pulldom
(Pdb) l
1931            return _do_pulldom_parse(pulldom.parseString, (string,),
1932                                     {'parser': parser})
1933
1934    def getDOMImplementation(features=None):
1935        if features:
1936            if isinstance(features, StringTypes):
1937                features = domreg._parse_feature_string(features)
1938            for f, v in features:
1939                if not Document.implementation.hasFeature(f, v):
1940                    return None
1941        return Document.implementation
(Pdb) n
> /shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/python/2.6.4-cms9/lib/python2.6/xml/dom/minidom.py(1916)parse()
-> if parser is None and not bufsize:
(Pdb)
> /shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/python/2.6.4-cms9/lib/python2.6/xml/dom/minidom.py(1917)parse()
-> from xml.dom import expatbuilder
(Pdb)
> /shome/leo/CMSSW_RELEASES/slc5_amd64_gcc434/external/python/2.6.4-cms9/lib/python2.6/xml/dom/minidom.py(1918)parse()
-> return expatbuilder.parse(file)
(Pdb)
Segmentation fault

The issue seems located in the python libraries shipped with CMSSW_3_10_0_pre7io, which are version 2.6.4-cms9. Doing a test and substituting in LD_LIBRARY_PATH with 2.6.4-cms6 (shipped with CMSSW_3_8_6) crab works.

dCache with GSIdcap

This fails with segmentation fault regardless of having Frontier in the game or not. The intersting difference regarding GSIdcap is the fact that the 32bit libgsiTunne.so is not linked shared against openssl while the 64bit flavour is:

ldd /opt/d-cache/dcap/lib/libgsiTunnel.so 
ldd: warning: you do not have execution permission for `/opt/d-cache/dcap/lib/libgsiTunnel.so'
        linux-gate.so.1 =>  (0xffffe000)
        libdl.so.2 => /lib/libdl.so.2 (0xf7de9000)
        libcrypt.so.1 => /lib/libcrypt.so.1 (0xf7db7000)
        libresolv.so.2 => /lib/libresolv.so.2 (0xf7da2000)
        libc.so.6 => /lib/libc.so.6 (0xf7c49000)
        /lib/ld-linux.so.2 (0x00bac000)

 ldd /opt/d-cache/dcap/lib/libgsiTunnel.so 
ldd: warning: you do not have execution permission for `/opt/d-cache/dcap/lib/libgsiTunnel.so'
        linux-gate.so.1 =>  (0xffffe000)
        libdl.so.2 => /lib/libdl.so.2 (0xf7de9000)
        libcrypt.so.1 => /lib/libcrypt.so.1 (0xf7db7000)
        libresolv.so.2 => /lib/libresolv.so.2 (0xf7da2000)
        libc.so.6 => /lib/libc.so.6 (0xf7c49000)
        /lib/ld-linux.so.2 (0x00bac000)
sgmcms@t2-cms-vo2: [~/Testing/slc5_amd64_gcc434/CMSSW_3_10_0_pre7io/src] ldd /opt/d-cache/dcap/lib64/libgsiTunnel.so 
        libglobus_gssapi_gsi_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gssapi_gsi_gcc64pthr.so.0 (0x00002b62584ad000)
        libglobus_gsi_callback_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gsi_callback_gcc64pthr.so.0 (0x00002b62586c5000)
        libglobus_gsi_cert_utils_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gsi_cert_utils_gcc64pthr.so.0 (0x00002b62588cf000)
        libglobus_gsi_proxy_core_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gsi_proxy_core_gcc64pthr.so.0 (0x00002b6258ad4000)
        libglobus_gsi_credential_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gsi_credential_gcc64pthr.so.0 (0x00002b6258ce2000)
        libglobus_openssl_gcc64pthr.so.0 => /opt/globus/lib/libglobus_openssl_gcc64pthr.so.0 (0x00002b6258ef1000)
        libglobus_gsi_sysconfig_gcc64pthr.so.0 => /opt/globus/lib/libglobus_gsi_sysconfig_gcc64pthr.so.0 (0x00002b62590f3000)
        libglobus_openssl_error_gcc64pthr.so.0 => /opt/globus/lib/libglobus_openssl_error_gcc64pthr.so.0 (0x00002b62592fe000)
        libglobus_oldgaa_gcc64pthr.so.0 => /opt/globus/lib/libglobus_oldgaa_gcc64pthr.so.0 (0x00002b6259502000)
        libglobus_proxy_ssl_gcc64pthr.so.0 => /opt/globus/lib/libglobus_proxy_ssl_gcc64pthr.so.0 (0x00002b625970a000)
        libglobus_common_gcc64pthr.so.0 => /opt/globus/lib/libglobus_common_gcc64pthr.so.0 (0x00002b625990f000)
        libz.so.1 => /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/external/slc5_amd64_gcc434/lib/libz.so.1 (0x00002b6259b4c000)
        libc.so.6 => /lib64/libc.so.6 (0x00002b6259b8a000)
        libltdl_gcc64pthr.so.3 => /opt/globus/lib/libltdl_gcc64pthr.so.3 (0x00002b6259ee1000)
        libssl.so.6 => /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/external/slc5_amd64_gcc434/lib/libssl.so.6 (0x00002b625a0e9000)
        libcrypto.so.6 => /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/external/slc5_amd64_gcc434/lib/libcrypto.so.6 (0x00002b625a132000)
        libdl.so.2 => /lib64/libdl.so.2 (0x00002b625a295000)
        libpthread.so.0 => /lib64/libpthread.so.0 (0x00002b625a49a000)
        /lib64/ld-linux-x86-64.so.2 (0x0000003242a00000)

cmsRun crashes actually with traceback rather similar to the DPM case.

%MSG-e Root_Error:  PoolSource:source@sourceConstruction  TUnixSystem::DispatchSignals() 16-Dec-2010 11:08:42 CET pre-events
segmentation violation
%MSG
Attaching to program: /proc/5979/exe, process 5979
[Thread debugging using libthread_db enabled]
[New Thread 0x4206f940 (LWP 5981)]
[New Thread 0x4166e940 (LWP 5980)]
0x0000003242e99fff in waitpid () from /lib64/libc.so.6
Thread 3 (Thread 0x4166e940 (LWP 5980)):
#0  0x0000003243a0e838 in do_sigwait () from /lib64/libpthread.so.0
#1  0x0000003243a0e8dd in sigwait () from /lib64/libpthread.so.0
#2  0x00002b0d699dede5 in globus_l_callback_thread_signal_poll ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#3  0x00002b0d699f2c58 in thread_starter ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#4  0x0000003243a0673d in start_thread () from /lib64/libpthread.so.0
#5  0x0000003242ed3f6d in clone () from /lib64/libc.so.6
#6  0x0000000000000000 in ?? ()

Thread 2 (Thread 0x4206f940 (LWP 5981)):
#0  0x0000003243a0aee9 in pthread_cond_wait@@GLIBC_2.3.2 ()
   from /lib64/libpthread.so.0
#1  0x00002b0d699f2696 in globus_cond_wait ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#2  0x00002b0d699dfddd in globus_l_callback_thread_poll ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#3  0x00002b0d699f2edf in globus_l_thread_pool_thread_start ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#4  0x00002b0d699f2c58 in thread_starter ()
   from /opt/globus/lib/libglobus_common_gcc64pthr.so.0
#5  0x0000003243a0673d in start_thread () from /lib64/libpthread.so.0
#6  0x0000003242ed3f6d in clone () from /lib64/libc.so.6
#7  0x0000000000000000 in ?? ()

Thread 1 (Thread 0x2b0d67067390 (LWP 5979)):
#0  0x0000003242e99fff in waitpid () from /lib64/libc.so.6
#1  0x0000003242e3c331 in do_system () from /lib64/libc.so.6
#2  0x0000003242e3c687 in system () from /lib64/libc.so.6
#3  0x00002b0d65b32a65 in TUnixSystem::StackTrace() ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/external/slc5_amd64_gcc434/lib/libCore.so
#4  0x00002b0d65b33605 in TUnixSystem::DispatchSignals(ESignals) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/external/slc5_amd64_gcc434/lib/libCore.so
#5  <signal handler called>
#6  SSL_CIPHER_description (cipher=0x735b442e1905f2e0, 
    buf=0x7fff41ad4b00 "H*\025C2", len=256) at ssl_ciph.c:1110
#7  0x00002b0d68579001 in globus_i_gsi_gss_handshake ()
   from /opt/globus/lib/libglobus_gssapi_gsi_gcc64pthr.so.0
#8  0x00002b0d68574492 in gss_init_sec_context ()
   from /opt/globus/lib/libglobus_gssapi_gsi_gcc64pthr.so.0
#9  0x00002b0d68357507 in eInit () from /opt/d-cache/dcap/lib64/libgsiTunnel.so
#10 0x00002b0d6834006e in cache_connect (srv=0xa3c8120) at dcap.c:751
#11 0x00002b0d68340568 in serverConnect (node=0xa3c85a0) at dcap.c:569
#12 initControlLine (node=0xa3c85a0) at dcap.c:306
#13 0x00002b0d683412f4 in cache_open (node=0xa3c85a0) at dcap.c:250
#14 0x00002b0d68345438 in dc_open (
    fname=0xa3c8438 "gsidcap://dcache-cms-gsidcap.desy.de:22128//pnfs/desy.de/cms/tier2/store/mc/JobRobot/RelValProdTTbar/GEN-SIM-DIGI-RECO/MC_3XY_V24_JobRobot-v1/0000//0EDC7961-822C-DF11-A0BE-001617E30E2C.root", flags=2048)
    at dcap_open.c:280
#15 0x00002b0d68345cc3 in dc_stage (
    path=0x735b442e1905f2e0 <Address 0x735b442e1905f2e0 out of bounds>, 
    atime=<value optimized out>, location=0xa40b4a0 "") at dcap_open.c:409
#16 0x00002b0d6832ba51 in DCacheStorageMaker::stagein(std::string const&, std::string const&) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/pluginUtilitiesDCacheAdaptorPlugin.so
#17 0x00002b0d67879f25 in StorageFactory::stagein(std::string const&) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/libUtilitiesStorageFactory.so
#18 0x00002b0d6819081c in edm::RootInputFileSequence::RootInputFileSequence(edm::ParameterSet const&, edm::PoolSource const&, edm::InputFileCatalog const&, edm::PrincipalCache&, bool) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#19 0x00002b0d6815ff42 in edm::PoolSource::PoolSource(edm::ParameterSet const&, edm::InputSourceDescription const&) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#20 0x00002b0d6815dc84 in edmplugin::PluginFactory<edm::InputSource* ()(edm::ParameterSet const&, edm::InputSourceDescription const&)>::PMaker<edm::PoolSource>::create(edm::ParameterSet const&, edm::InputSourceDescription const&) const ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/pluginIOPoolInput.so
#21 0x00002b0d643c8155 in edm::InputSourceFactory::makeInputSource(edm::ParameterSet const&, edm::InputSourceDescription const&) const ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#22 0x00002b0d6436a4a5 in edm::makeInput(edm::ParameterSet&, edm::EventProcessor::CommonParams const&, edm::ProductRegistry&, edm::PrincipalCache&, boost::shared_ptr<edm::ActivityRegistry>, boost::shared_ptr<edm::ProcessConfiguration>) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#23 0x00002b0d6436dd13 in edm::EventProcessor::init(boost::shared_ptr<edm::ProcessDesc>&, edm::ServiceToken const&, edm::serviceregistry::ServiceLegacy) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#24 0x00002b0d643766f3 in edm::EventProcessor::EventProcessor(boost::shared_ptr<edm::ProcessDesc>&, edm::ServiceToken const&, edm::serviceregistry::ServiceLegacy) ()
   from /opt/vo/cms/slc5_amd64_gcc434/cms/cmssw/CMSSW_3_10_0_pre7io/lib/slc5_amd64_gcc434/libFWCoreFramework.so
#25 0x000000000040d833 in main ()
A debugging session is active.

        Inferior 1 [process 5979] will be detached.

-- LeonardoSala - 07-Dec-2010

Edit | Attach | Watch | Print version | History: r9 < r8 < r7 < r6 < r5 | Backlinks | Raw View | WYSIWYG | More topic actions
Topic revision: r9 - 2010-12-26 - unknown
 
    • Cern Search Icon Cern Search
    • TWiki Search Icon TWiki Search
    • Google Search Icon Google Search

    Main All webs login

This site is powered by the TWiki collaboration platform Powered by PerlCopyright &© 2008-2024 by the contributing authors. All material on this collaboration platform is the property of the contributing authors.
or Ideas, requests, problems regarding TWiki? use Discourse or Send feedback