Author |
Message |
TylerChris
Send message
Joined: Mar 29 07
Posts: 23
Credit: 513,393
RAC: 2
|
Seeing a large rise in this issue on 'B'WUs only.
On my machine it only occurs on the longer work units.
eg.
explain
Status
Run time
(sec)
CPU time
(sec)
Credit
Application
113069493
442248
8 Feb 2012 4:53:10 UTC
8 Feb 2012 16:44:18 UTC
Completed, marked as invalid
26,483.56
26,443.82
0.00
openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
113075412
152827
8 Feb 2012 5:06:19 UTC
8 Feb 2012 21:39:50 UTC
Completed, marked as invalid
24,596.31
22,597.09
0.00
openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
113144426
406355
8 Feb 2012 21:52:19 UTC
12 Feb 2012 12:31:08 UTC
Completed and validated
20,686.63
20,440.56
148.91
openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
113496957
439960
12 Feb 2012 9:16:16 UTC
13 Feb 2012 0:52:09 UTC
Completed, marked as invalid
15,746.01
15,746.01
0.00
openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
113564291
203599
13 Feb 2012 1:08:37 UTC
13 Feb 2012 12:44:18 UTC
Completed and validated
19,291.73
17,691.00
148.91
openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
.
Task is Here
Cannot find anything wrong in the logs.
Thanks
Chris
|
|
ukjohnd
Send message
Joined: Jan 14 07
Posts: 2
Credit: 642,105
RAC: 0
|
Same here.
This one invalid
https://malariacontrol.net/workunit.php?wuid=60472250
This one valid
https://malariacontrol.net/workunit.php?wuid=60746137
Plus quite a lot of others
|
|
P . P . L .

Send message
Joined: Aug 27 08
Posts: 56
Credit: 500,976
RAC: 0
|
Hi.
Add me to this group of errors, they are only on these long tasks run on (Branch B) v6.57 app
i don't have any trouble with v6.58 or other projects that have longer tasks!
Some are taking up to 12hrs to finish, what a waste.
https://malariacontrol.net/workunit.php?wuid=61821326
Name wu_1203_417_152316_0_1330661466_1
Workunit 61821326
Created 2 Mar 2012 5:48:35 UTC
Sent 2 Mar 2012 5:58:24 UTC
Received 3 Mar 2012 6:50:31 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 176059
Report deadline 5 Mar 2012 17:18:24 UTC
Run time 48,907.79
CPU time 48,304.58
Validate state Invalid
Credit 0.00
Application version openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
=======================================================================
https://malariacontrol.net/workunit.php?wuid=61670804
Name wu_1204_416_151196_0_1330474869_1
Workunit 61670804
Created 29 Feb 2012 1:39:46 UTC
Sent 29 Feb 2012 2:03:54 UTC
Received 29 Feb 2012 23:15:26 UTC
Server state Over
Outcome Success
Client state Done
Exit status 0 (0x0)
Computer ID 176059
Report deadline 3 Mar 2012 13:23:54 UTC
Run time 41,119.67
CPU time 40,605.60
Validate state Invalid
Credit 0.00
Application version openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
==========================================================
I have/had others like this one marked as inconclusive, i don't what your problem is with these.
https://malariacontrol.net/workunit.php?wuid=61737301
Completed, validation inconclusive 47,069.73 46,397.23 pending openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
This one will more than likely go the same way, i've now switched off all (Branch B) v6.57 on my rigs over this.
https://malariacontrol.net/workunit.php?wuid=61795996
Completed, waiting for validation 21,968.61 21,068.62 pending openMalaria: A simulator of malaria epidemology and control (Branch B) v6.57
____________

|
|
mikey

Send message
Joined: Mar 23 07
Posts: 4382
Credit: 5,361,193
RAC: 1,084
|
Are you guys crunching multiple projects meaning Malaria has to pause while another project runs and then start back up again? The reason I ask is I run Malaria on a pc that ONLY runs Malaria and I am having NO problems at all. Knock on wood!!
|
|
Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408
|
Unsuspended result (openMalariaB v6.57, it ran uninterrupted), invalid, I guess it's a Linux vs. Windows issue :
wu_1190_506_177621_0_1337957189
Linux :
Warning: will use heterogeneity workaround.
sim end
T/A: 1406908/1406908 <======================
20:30:10 (27468): called boinc_finish
Windows :
Warning: will use heterogeneity workaround.
sim end
T/A: 1403145/1403145 <======================
22:15:50 (1792): called boinc_finish
The box does not tend to have invalid/inconclusive results, usually it has only trouble with the checkpoints (BOINC heartbeat bug) when several Malaria results checkpoint simuntanously
|
|
swiftmallard

Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0
|
Here is one that is marked as successful but no credit is granted:
https://malariacontrol.net/result.php?resultid=125933327
Neither I nor my wingman received credit.
|
|
Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408
|
That's the default reward for crunching long running workunits :-(
|
|
swiftmallard

Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0
|
I completed several others almost as long and received credit.
https://malariacontrol.net/result.php?resultid=126025075
https://malariacontrol.net/result.php?resultid=126025076
https://malariacontrol.net/result.php?resultid=126025074
|
|
Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408
|
probably just below the limit. Check the other threads lately, others have the problem too ("Unusual result", "Errors Overnight" and "Long Run Times")
|
|
swiftmallard

Send message
Joined: Jul 24 09
Posts: 651
Credit: 1,130,259
RAC: 0
|
What limit?
|
|
Ananas
Send message
Joined: Mar 7 06
Posts: 58
Credit: 752,054
RAC: 408
|
Claimed credits or used-up CPU operations (those values are directly related). Those are a multiplication of runtime, benchmark results and a factor.
Above that limit (I guess has a fixed ratio to some average value), BOINC doesn't grant any credits, below it does acording to the project rules.
|
|
michaelT
Volunteer moderator
Project administrator
Project developer
Project scientist Send message
Joined: Jul 20 10
Posts: 47
Credit: 16,359
RAC: 0
|
Regarding the non credit issue the problem :
It seems that this is due to some issues with the validator : there is a MAX_GRANTED_CREDIT parameter which should in theory grant MAX_GRANTED_CREDIT (it avoids cheating with high credit request) if WU_CREDIT > MAX_GRANTED_CREDIT but in our case it granted 0 credit ... :(
Some of the 0 granted workunits have already been purged but we manage to get all the hosts and the average credits for all those ones. So for the one who didn't get credit before it's fixed now.
We increased the MAX_GRANTED_CREDIT like that this should not be a problem anymore. But let me know if it happen again.
|
|
Post to thread