client failure detection in client-server systems (distributed)

2023-01-24 06:36 问答作者：

Assume a distributed communication system where client and server communicate via a stateless channel.

The client sends requests to the server and the server does processing and keeps internal records for each client.

Server sends back notifications to the clients as various events happen to the system, as needed.

The notification mechanism depends on the internal records.

My question is, what is the standard appoach in distributed computing to handle the client failures?

I.e. in this context, assume that the client process crashes or simply restarts. The server still has the records for the client but now client and server are of sync. As a result client will get notifications according to records created before restart. This is undesirable.

What is a standardized way to detect the client failures? E.g. client has restarted and previous records must be erased?

I thought of periodic开发者_JAVA百科 callbacks to clients and if a client is not reachable, erase its records but I am not sure if this is a good idea. [EDIT] I thought of callbacks because, the period events send back to the client can be in very large intervals and so the client failure would not be noticable soon

Can anyone help on this? The context of my application domain is web services.

Thank you!

The standard approach varies from system to system depending to the architecture and domain. How the server finds out that the client is down? I think you don't need callbacks, since you send the notifications and can detect that the client is unreachable. For example:

send a notification to the client;
if success, goto 1;
else erase all the notifications in the queue for the client, set a flag to not collect events for the client.

When a client is connected:

unset the flag;
start sending notifications

Or even a simpler approach:

erase the notification queue for the client when it connects before initializing the conversation;
run a low-priority thread to erase all the notifications for all the clients which are older then X, to clean notifications for the client which will never come back.

Update after the original author comments

It strongly depends on how things are organized in your system. Assuming:

The server starts a thread (let's call it "agent") to serve a client, a thread per client.
The agent exits when the clients shuts down the session properly or goes down.
there is a private (which is not shared among agents/clients) record set for each client
there is a shared list of current clients which is used by another component (not an ordinary agent, let's call it "dispatcher") to distribute records for clients.

solution: 1. the server starts an agent and registers the client just connected to list of clients. The dispatcher gets notified that a new client arrived. 2. the agent consumes the records until client is connected. On client's shutdown and/or failure the agents unregisters the client and cleans the record set.

If things in your system aren't organized in the way described above, please provide some details.

继续阅读：client-server distributed web-applications

client failure detection in client-server systems (distributed)

Update after the original author comments

更多精彩内容

精彩评论

最新问答

绝区零和崩坏星穹铁道谁更吃配置?？

电视机蓝屏,边缘处带红是哪里的毛病？

双侧输卵管远端积水怎么治疗？

关于学生看电视有什么好处与坏处?？

永劫无间手游崔三娘魂玉怎么选择?？

问答排行榜

Escaping "<" in Perl-generated XML

微信重新建群怎么建？

imessage会显示已读吗？

太快了能不能慢一点好爽~好大~不要拔出来了？

二年级家长回音怎么写大全简短的（二年级家长回音怎么写）？

Update after the original author comments

更多精彩内容

精彩评论

最新问答

绝区零和崩坏星穹铁道谁更吃配置?？

电视机蓝屏,边缘处带红是哪里的毛病？

双侧输卵管远端积水怎么治疗？

关于学生看电视有什么好处与坏处?？

永劫无间手游崔三娘魂玉怎么选择?？

问答排行榜

Escaping "<" in Perl-generated XML

微信重新建群怎么建？

imessage会显示已读吗？

太快了能不能慢一点 好爽~好大~不要拔出来了？

二年级家长回音怎么写大全简短的（二年级家长回音怎么写）？

太快了能不能慢一点好爽~好大~不要拔出来了？