No heartbeat from core client for 31 sec - exiting
-
- Posts: 128
- Joined: Sun Apr 09, 2006 1:00 am
No heartbeat from core client for 31 sec - exiting
Anyone got any ideas what "No heartbeat from core client for 31 sec - exiting" is all about?
It only happens when running SETI on my Linux based PC
It only happens when running SETI on my Linux based PC
-
- Posts: 3790
- Joined: Mon Mar 13, 2006 12:00 am
-
- Posts: 3790
- Joined: Mon Mar 13, 2006 12:00 am
No heartbeat from core client - exiting
From Unofficial BOINC Wiki
Jump to: navigation, search
[edit]General
Message Type: Error Message
This error is caused by one or more of the running processes, the BOINC Daemon that make up the BOINC Client Software on the Participant's Computer has stopped running (it "crashed").
In other words, something real bad happened. The usual suspect is the BOINC Daemon, but it can also be from the failure of the Science Application.
Now, for those that want to know more, a "heart-beat" (or heartbeat) is a periodic message sent from one software component to another telling that other software component, "I am alive and well!". In the BOINC Client Software we have signals going from the Science Application to the BOINC Daemon, and a separate set of signals going in the other direction.
If the BOINC Daemon stops running, we want the Science Application to also stop, and vice versa. If one dies, the other should die also. These heart-beat signals are common in software systems where there are multiple components that run essentially independent of each other. They are just small messages and they are repeated every few minutes or so. So, they don't take much away from your hunt for maximum credit.
Courtesy of Walt Gribben (with minor edits by Paul):
This message means that the BOINC Client Software stopped communicating with the Science Application.
The BOINC Daemon sends a heartbeat message out so that the Science Application programs know its still alive and kicking. So if the messages stop, its supposed to mean that the BOINC Daemon isn't running anymore (perhaps it crashed?) and the Science Applications are also supposed to exit. Thats after they don't get a heartbeat message for 30 seconds. So, they print the "no heartbeat" error and exit. The Science Applications are using an exit code of zero to indicate there isn't any error, at least not with the Work Unit.
Later, the BOINC Daemon sees that the Science Application exited (zero status) but wasn't finished with the Work Unit (there is no finished file) so it restarts the Work Unit. And from where it left off, or at least from the last Checkpoint.
There might be a problem with BOINC Daemon to the Science Application communications, but its not all that serious. Some time is lost in restarting the Work Unit, but its not like it has to start from the beginning each time. After I saw the Work Units were completing in around the same time whether or not they got "no heartbeat" messages, I stopped looking into it.
-
- Posts: 128
- Joined: Sun Apr 09, 2006 1:00 am
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
I'm not sure you can change it, its more an internal boinc->app communication thing.http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Welcome to the forum BTW, how's things in Taiwan?
-
- UBT Forum Admin
- Posts: 9710
- Joined: Mon Mar 13, 2006 12:00 am
- Location: NW Midlands
- Contact:
As far as I know there is a "built-in" 30 second period after which the BOINC client check to see that the application is till running OK. If so, I doubt it can be changed by the user - why do you want to check every 5 seconds - isn't 30 secnods OK?http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Also: Suggest you visit the BOINC site here:
http://boinc.berkeley.edu/dev/ and maybe read the messages in the forum there - or even post a message.
regards,
Tim
-
- Posts: 128
- Joined: Sun Apr 09, 2006 1:00 am
Why would you want to? It would take valuable CPU time away from crunching the work.http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox. If you want to advertise your web site, put a link in your sig.
-
- UBT Forum Admin
- Posts: 9710
- Joined: Mon Mar 13, 2006 12:00 am
- Location: NW Midlands
- Contact:
Paul,UBT - PaulT wrote:Why would you want to? It would take valuable CPU time away from crunching the work.http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox. If you want to advertise your web site, put a link in your sig.
His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):
http://www.boincstats.com/stats/boinc_u ... amid=27470
Wonder if he would like to join OUR team ????
regards,
Tim
-
- Posts: 128
- Joined: Sun Apr 09, 2006 1:00 am
don't think that he will, I looked down that list you posted. Not one of them in a team.UBT - Timbo wrote: Paul,
His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):
http://www.boincstats.com/stats/boinc_u ... amid=27470
Wonder if he would like to join OUR team ????
regards,
Tim
Most probably just did a search on the "no heartbeat" issue, found our forum and posted his question here.Temujin wrote:any idea how we've managed to attract someone from Taiwan?
apart from the obvious of we're the best team in the whole wide world :D
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
I run my boinc client by windows schedule(*.job). when the task is stop, the SETI application still run about 30 seconds. I hope the period become shorten.UBT - Timbo wrote:As far as I know there is a "built-in" 30 second period after which the BOINC client check to see that the application is till running OK. If so, I doubt it can be changed by the user - why do you want to check every 5 seconds - isn't 30 secnods OK?http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Also: Suggest you visit the BOINC site here:
http://boinc.berkeley.edu/dev/ and maybe read the messages in the forum there - or even post a message.
regards,
Tim
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
i'm sorry. Can i change my user name?UBT - PaulT wrote:Why would you want to? It would take valuable CPU time away from crunching the work.http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox. If you want to advertise your web site, put a link in your sig.
-
- Posts: 3790
- Joined: Mon Mar 13, 2006 12:00 am
-
- UBT Forum Admin
- Posts: 9710
- Joined: Mon Mar 13, 2006 12:00 am
- Location: NW Midlands
- Contact:
OK - so it's the way in which the SETI application shuts down that's the problem.http://ast.cable.nu wrote:I run my boinc client by windows schedule(*.job). when the task is stop, the SETI application still run about 30 seconds. I hope the period become shorten.
As far as I know, each application must do some "tidying up" in order to allow BOINC to sync with the next project application.
But I'm not a BOINC developer, so maybe you need to talk to Rom Walton at SETI - I'll PM you his email address....and hope it helps.
regards,
Tim
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
Tim,UBT - Timbo wrote:Paul,UBT - PaulT wrote:Why would you want to? It would take valuable CPU time away from crunching the work.http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox. If you want to advertise your web site, put a link in your sig.
His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):
http://www.boincstats.com/stats/boinc_u ... amid=27470
Wonder if he would like to join OUR team ????
regards,
Tim
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah
-
- UBT Forum Admin
- Posts: 9710
- Joined: Mon Mar 13, 2006 12:00 am
- Location: NW Midlands
- Contact:
http://ast.cable.nu wrote:Tim,
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah
Ah, Thanks - Congratulations of your high ranking position - well done!
So, what's your real name? I looked at your website - is it "Bill"?
Hope you are OK after the recent storm in Far East - I have family in HK and they had big storm warnings just before storm headed NE towards Taiwan . everything OK there now?
Also, must mention - MAYBE you might join our team? - you can see private area of forum plus also talk in "real time" on our chatroom - maybe that would be good - as most members of UBT are UK based - so you could be our 1st Far East member ???
regards,
Tim
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
I search by googleUBT - PaulT wrote:don't think that he will, I looked down that list you posted. Not one of them in a team.UBT - Timbo wrote: Paul,
His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):
http://www.boincstats.com/stats/boinc_u ... amid=27470
Wonder if he would like to join OUR team ????
regards,
Tim
Most probably just did a search on the "no heartbeat" issue, found our forum and posted his question here.Temujin wrote:any idea how we've managed to attract someone from Taiwan?
apart from the obvious of we're the best team in the whole wide world :D
-
- Posts: 6
- Joined: Tue May 23, 2006 1:00 am
I'm OK,thx. The big storm didn't attack Taiwan lastly.UBT - Timbo wrote:http://ast.cable.nu wrote:Tim,
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah
Ah, Thanks - Congratulations of your high ranking position - well done!
So, what's your real name? I looked at your website - is it "Bill"?
Hope you are OK after the recent storm in Far East - I have family in HK and they had big storm warnings just before storm headed NE towards Taiwan . everything OK there now?
Also, must mention - MAYBE you might join our team? - you can see private area of forum plus also talk in "real time" on our chatroom - maybe that would be good - as most members of UBT are UK based - so you could be our 1st Far East member ???
regards,
Tim
My name is Bill. I really appreciate your invitation. Although i like your team, i will stay my team.
My english is poor. inaptitude for using the chatroom.
-
- Posts: 315
- Joined: Mon Mar 13, 2006 12:00 am
-
- UBT Forum Admin
- Posts: 9710
- Joined: Mon Mar 13, 2006 12:00 am
- Location: NW Midlands
- Contact:
http://ast.cable.nu wrote:I'm OK,thx. The big storm didn't attack Taiwan lastly. My name is Bill. I really appreciate your invitation. Although i like your team, i will stay my team.
My english is poor. inaptitude for using the chatroom.
No worries - good luck with your Taiwan Team - they are doing very well and maybe they catch up with us sometime soon !.
It is good to hear from you anyways - please feel free to come back even if your English is not so good - you can practice with us !!
I am sure NONE of us can speak your language....!
regards,
Tim