No heartbeat from core client for 31 sec - exiting

Anything "BOINC" specific can be commented on here...such as Project news and announcements etc. Also: any problems with BOINC or maybe you have found something interesting, tell us about it. Chat about the various 3rd party client applications used for some of the projects such as optimised clients.
Post Reply
UBT - PaulT
Posts: 128
Joined: Sun Apr 09, 2006 1:00 am

No heartbeat from core client for 31 sec - exiting

Post by UBT - PaulT »

Anyone got any ideas what "No heartbeat from core client for 31 sec - exiting" is all about?

It only happens when running SETI on my Linux based PC
UBT - Halifax-lad
Posts: 3790
Joined: Mon Mar 13, 2006 12:00 am

Post by UBT - Halifax-lad »

It is usually a timing issue inside BOINC with the various aspects of it running.

It can happen on windows machines when the clock on XP is updated to the correct time by the automated process windows performs, meaning BOINC gets out of step generally by a few seconds
UBT - Halifax-lad
Posts: 3790
Joined: Mon Mar 13, 2006 12:00 am

Post by UBT - Halifax-lad »

No heartbeat from core client - exiting
From Unofficial BOINC Wiki
Jump to: navigation, search
[edit]General
Message Type: Error Message

This error is caused by one or more of the running processes, the BOINC Daemon that make up the BOINC Client Software on the Participant's Computer has stopped running (it "crashed").

In other words, something real bad happened. The usual suspect is the BOINC Daemon, but it can also be from the failure of the Science Application.

Now, for those that want to know more, a "heart-beat" (or heartbeat) is a periodic message sent from one software component to another telling that other software component, "I am alive and well!". In the BOINC Client Software we have signals going from the Science Application to the BOINC Daemon, and a separate set of signals going in the other direction.

If the BOINC Daemon stops running, we want the Science Application to also stop, and vice versa. If one dies, the other should die also. These heart-beat signals are common in software systems where there are multiple components that run essentially independent of each other. They are just small messages and they are repeated every few minutes or so. So, they don't take much away from your hunt for maximum credit.

Courtesy of Walt Gribben (with minor edits by Paul):

This message means that the BOINC Client Software stopped communicating with the Science Application.

The BOINC Daemon sends a heartbeat message out so that the Science Application programs know its still alive and kicking. So if the messages stop, its supposed to mean that the BOINC Daemon isn't running anymore (perhaps it crashed?) and the Science Applications are also supposed to exit. Thats after they don't get a heartbeat message for 30 seconds. So, they print the "no heartbeat" error and exit. The Science Applications are using an exit code of zero to indicate there isn't any error, at least not with the Work Unit.

Later, the BOINC Daemon sees that the Science Application exited (zero status) but wasn't finished with the Work Unit (there is no finished file) so it restarts the Work Unit. And from where it left off, or at least from the last Checkpoint.

There might be a problem with BOINC Daemon to the Science Application communications, but its not all that serious. Some time is lost in restarting the Work Unit, but its not like it has to start from the beginning each time. After I saw the Work Units were completing in around the same time whether or not they got "no heartbeat" messages, I stopped looking into it.
UBT - PaulT
Posts: 128
Joined: Sun Apr 09, 2006 1:00 am

Post by UBT - PaulT »

Thanks.  I'll try reinstalling and see what happens
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Temujin
Posts: 2259
Joined: Mon Mar 13, 2006 12:00 am

Post by Temujin »

http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
I'm not sure you can change it, its more an internal boinc->app communication thing.

Welcome to the forum BTW, how's things in Taiwan?
UBT - Timbo
UBT Forum Admin
Posts: 9680
Joined: Mon Mar 13, 2006 12:00 am
Location: NW Midlands
Contact:

Post by UBT - Timbo »

http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
As far as I know there is a "built-in" 30 second period after which the BOINC client check to see that the application is till running OK. If so, I doubt it can be changed by the user - why do you want to check every 5 seconds - isn't 30 secnods OK?


Also: Suggest you visit the BOINC site here:

http://boinc.berkeley.edu/dev/ and maybe read the messages in the forum there - or even post a message.

regards,

Tim
UBT - PaulT
Posts: 128
Joined: Sun Apr 09, 2006 1:00 am

Post by UBT - PaulT »

http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Why would you want to?  It would take valuable CPU time away from crunching the work.

Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox.  If you want to advertise your web site, put a link in your sig.
UBT - Timbo
UBT Forum Admin
Posts: 9680
Joined: Mon Mar 13, 2006 12:00 am
Location: NW Midlands
Contact:

Post by UBT - Timbo »

UBT - PaulT wrote:
http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Why would you want to?  It would take valuable CPU time away from crunching the work.

Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox.  If you want to advertise your web site, put a link in your sig.
Paul,

His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):

http://www.boincstats.com/stats/boinc_u ... amid=27470


Wonder if he would like to join OUR team ????

regards,

Tim
Temujin
Posts: 2259
Joined: Mon Mar 13, 2006 12:00 am

Post by Temujin »

any idea how we've managed to attract someone from Taiwan?
apart from the obvious of we're the best team in the whole wide world  :D
UBT - PaulT
Posts: 128
Joined: Sun Apr 09, 2006 1:00 am

Post by UBT - PaulT »

UBT - Timbo wrote: Paul,

His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):

http://www.boincstats.com/stats/boinc_u ... amid=27470


Wonder if he would like to join OUR team ????

regards,

Tim
don't think that he will,  I looked down that list you posted. Not one of them in a team.
Temujin wrote:any idea how we've managed to attract someone from Taiwan?
apart from the obvious of we're the best team in the whole wide world  :D
Most probably just did a search on the "no heartbeat" issue, found our forum and posted his question here.
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

UBT - Timbo wrote:
http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
As far as I know there is a "built-in" 30 second period after which the BOINC client check to see that the application is till running OK. If so, I doubt it can be changed by the user - why do you want to check every 5 seconds - isn't 30 secnods OK?


Also: Suggest you visit the BOINC site here:

http://boinc.berkeley.edu/dev/ and maybe read the messages in the forum there - or even post a message.

regards,

Tim
I run my boinc client by windows schedule(*.job). when the task is stop, the SETI application still run about 30 seconds. I hope the period become shorten.
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

UBT - PaulT wrote:
http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Why would you want to?  It would take valuable CPU time away from crunching the work.

Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox.  If you want to advertise your web site, put a link in your sig.
i'm sorry. Can i change my user name?
UBT - Halifax-lad
Posts: 3790
Joined: Mon Mar 13, 2006 12:00 am

Post by UBT - Halifax-lad »

You don't have to change it if you want to stick with that username
UBT - Timbo
UBT Forum Admin
Posts: 9680
Joined: Mon Mar 13, 2006 12:00 am
Location: NW Midlands
Contact:

Post by UBT - Timbo »

http://ast.cable.nu wrote:I run my boinc client by windows schedule(*.job). when the task is stop, the SETI application still run about 30 seconds. I hope the period become shorten.
OK - so it's the way in which the SETI application shuts down that's the problem.

As far as I know, each application must do some "tidying up" in order to allow BOINC to sync with the next project application.

But I'm not a BOINC developer, so maybe you need to talk to Rom Walton at SETI - I'll PM you his email address....and hope it helps.

regards,

Tim
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

UBT - Timbo wrote:
UBT - PaulT wrote:
http://ast.cable.nu wrote:How can I change the Science Applications listen Boinc's heartbeat message time from 30 seconds to 5 seconds? thx!
Why would you want to?  It would take valuable CPU time away from crunching the work.

Please change you user name so that it's not a URL, it plays havoc with the Linkification Extension in Firefox.  If you want to advertise your web site, put a link in your sig.
Paul,

His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):

http://www.boincstats.com/stats/boinc_u ... amid=27470


Wonder if he would like to join OUR team ????

regards,

Tim
Tim,
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah
UBT - Timbo
UBT Forum Admin
Posts: 9680
Joined: Mon Mar 13, 2006 12:00 am
Location: NW Midlands
Contact:

Post by UBT - Timbo »

http://ast.cable.nu wrote:Tim,
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah

Ah, Thanks - Congratulations of your high ranking position - well done!

So, what's your real name? I looked at your website  - is it "Bill"?

Hope you are OK after the recent storm in Far East - I have family in HK and they had big storm warnings just before storm headed NE towards Taiwan . everything OK there now?

Also, must mention - MAYBE you might join our team? - you can see private area of forum plus also talk in "real time" on our chatroom - maybe that would be good - as most members of UBT are UK based - so you could be our 1st Far East member ???

regards,

Tim
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

UBT - PaulT wrote:
UBT - Timbo wrote: Paul,

His username on the forum is actually his username on the BOINC project (where's he's at #2 in his team with over 2.7 million credits):

http://www.boincstats.com/stats/boinc_u ... amid=27470


Wonder if he would like to join OUR team ????

regards,

Tim
don't think that he will,  I looked down that list you posted. Not one of them in a team.
Temujin wrote:any idea how we've managed to attract someone from Taiwan?
apart from the obvious of we're the best team in the whole wide world  :D
Most probably just did a search on the "no heartbeat" issue, found our forum and posted his question here.
:D I search by google :D
http://ast.cable.nu
Posts: 6
Joined: Tue May 23, 2006 1:00 am

Post by http://ast.cable.nu »

UBT - Timbo wrote:
http://ast.cable.nu wrote:Tim,
You get it. I'm #2 in my team. also #2 in my country.(S@H)
http://www.boincsynergy.com/stats/count ... roject=sah

Ah, Thanks - Congratulations of your high ranking position - well done!

So, what's your real name? I looked at your website  - is it "Bill"?

Hope you are OK after the recent storm in Far East - I have family in HK and they had big storm warnings just before storm headed NE towards Taiwan . everything OK there now?

Also, must mention - MAYBE you might join our team? - you can see private area of forum plus also talk in "real time" on our chatroom - maybe that would be good - as most members of UBT are UK based - so you could be our 1st Far East member ???

regards,

Tim
I'm OK,thx. The big storm didn't attack Taiwan lastly.
My name is Bill. I really appreciate your invitation. Although i like your team, i will stay my team.
My english is poor. inaptitude for using the chatroom. :oops:
UBT - BHCJackie
Posts: 315
Joined: Mon Mar 13, 2006 12:00 am

Post by UBT - BHCJackie »

Bill - please don't let your English stop you from joining us in the chatroom.  Your English is probably better than my attempts at any non-English language.
UBT - Timbo
UBT Forum Admin
Posts: 9680
Joined: Mon Mar 13, 2006 12:00 am
Location: NW Midlands
Contact:

Post by UBT - Timbo »

http://ast.cable.nu wrote:I'm OK,thx. The big storm didn't attack Taiwan lastly. My name is Bill. I really appreciate your invitation. Although i like your team, i will stay my team.
My english is poor. inaptitude for using the chatroom. :oops:

No worries - good luck with your Taiwan Team - they are doing very well and maybe they catch up with us sometime soon !.

It is good to hear from you anyways - please feel free to come back even if your English is not so good - you can practice with us !!

I am sure NONE of us can speak your language....!

regards,

Tim
Post Reply