https://malariacontrol.net/workunit.php?wuid=61819797
Hi,
I have a WU that has given an error and has taken a long time.
Should I abort it and reset the project as mentioned in the error text?
I would like to know more about what causes the "no shared memory segment" error.
Does that error mean that I should abort the WU?
Or does it mean that BOINC did restart the WU?
Background data:
from BOINC Messages:
Fri 17 Feb 2012 01:49:15 AM EST Starting BOINC client version 6.10.58 for i686-pc-linux-gnu
Fri 17 Feb 2012 01:49:15 AM EST Processor: 1 GenuineIntel Intel(R) Celeron(R) M processor 1.70GHz [Family 6 Model 13 Stepping 8]
Fri 17 Feb 2012 01:49:15 AM EST Processor: 1.00 MB cache
Fri 17 Feb 2012 01:49:15 AM EST Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov clflush dts acpi mmx fxsr sse sse2 ss tm pbe nx up bts
Fri 17 Feb 2012 01:49:15 AM EST OS: Linux: 2.6.32-5-686
Fri 17 Feb 2012 01:49:15 AM EST Memory: 1.97 GB physical, 4.66 GB virtual
Fri 17 Feb 2012 01:49:15 AM EST Disk: 41.25 GB total, 33.98 GB free
[ skip thru time........]
Fri 02 Mar 2012 12:23:15 AM EST malariacontrol.net Starting wu_1203_426_152303_0_1330659171_1
Fri 02 Mar 2012 01:00:00 PM EST Suspending computation - time of day
Fri 02 Mar 2012 01:00:11 PM EST malariacontrol.net Task wu_1203_426_152303_0_1330659171_1: no shared memory segment
Fri 02 Mar 2012 01:00:11 PM EST malariacontrol.net Task wu_1203_426_152303_0_1330659171_1 exited with zero status but no 'finished' file
Fri 02 Mar 2012 01:00:11 PM EST malariacontrol.net If this happens repeatedly you may need to reset the project.
Fri 02 Mar 2012 02:00:00 PM EST Resuming computation
------
FROM WU Details
Workunit 61819797
name wu_1203_426_152303_0_1330659171
application openMalaria: A simulator of malaria epidemology and control (Branch B)
created 2 Mar 2012 3:32:51 UTC
Tasks in progress suppressed pending completion
---
Link to the task:
https://malariacontrol.net/result.php?resultid=115484707
At 23:21 EST, The BOINC status says:
Time elapsed: 21:57:50
Progress: 76%
To Completion: 06:31.34
I have BOINC set up to suspend all projects for 2 hours in the afternoon when an antivirus scan runs.
It looks like the error happened when BOINC tried to suspend.
All suggestions, advice, tomatoes are welcome.
(I'll to avoid the tomatoes...)
T H A N K Y O U !!
Jay E.
|