Choice of transports for JSON over TCP
I'm writing a simple streaming JSON service. It consists of JSON messages, sent intermittently, for a long period of time (weeks or months).
What is the best practise with regard to sending multiple JSON messages over a plain TCP socket?
Some alternatives I have looked at (and their downsides) are:
- newline separated JSON - downside: newlines within JSON require escaping, or prohibition
- websocket inspired 0x00 0xff framing - downside: it's now binary, not utf-8 anymore
- real websockets - downside: lack of (opensource) websocket client libraries
- http multipart http://www.w3.org/Protocols/rfc1341/7_2_Multipart.html - downside: incomplete client support?
- no delimiters - downside: chunking requires JSON parsing 开发者_如何转开发(can't just count curlies because of curlies in strings)
Is there a good, or at least well-established way of doing this?
my first two options would be:
Do what early TCP protocols do: send one message (a JSON object in your case) and close the connection. The client detects it and reopens to get the next object.
- pros: very easy to parse, no extra (content) bytes sent. any loss of data means losing just a single object. if you can stand that, there's no need to add retransmision to your app.
- cons: if you send a (huge) lot of (very) small objects, the three-packet TCP handshake adds to latency.
Do what chunked-mode HTTP does: first send the number of bytes in the JSON object, a newline (CRLF in HTTP), and your JSON object. The client just have to count bytes to know when the next byte would be the next objectsize.
- pros: you keep one long-lived stream.
- cons: a few extra bytes, you have to keep a long-lived stream, so accidental break and reconnection has to be handled as exceptional events, need to establish some handshaking to continue where it failed.
When you want to serve browser clients, the closest you get to raw TCP is WebSockets.
WebSockets has sufficient momentum that browser vendors will improve support (Chrome 14 and Firefox 7/8 support the latest protocol draft) and that a broad range of client and server frameworks will support it.
There are already a couple of open-source client libraries, including Autobahn WebSocket.
When you want to bake something for your own (on top of raw TCP), I would recommend a length-prefixed format for your JSON messages, i.e. Netstrings
Disclaimer: I am author of Autobahn and work for Tavendo.
I've codified what I and some other developers are doing:
http://en.wikipedia.org/wiki/Line_Delimited_JSON
It has the advantage of being netcat/telnet compatible.
See also: http://ndjson.org/
The first of four bytes of the message can be an 32-bit integer indicating size (in bytes) of the message. Then the receiver should follow these steps:
- Read the first four bytes of data and figure out the exact amount of bytes you need to read the whole message.
- Read the rest of the message and deserialize it as a JSON
Sender code in C#:
public void WriteMessage(Packet packet) {
// Convert the object to JSON
byte[] message = Encoding.UTF8.GetBytes(packet.Serialize());
// Serialize the number of characters
byte[] messageLength = BitConverter.GetBytes(message.Length);
// Build the full message that will hold both the size of the message and the message itself
byte[] buffer = new byte[sizeof(int) + message.Length];
Array.Clear(message, 0, message.Length);
// Print the size into the buffer
for (int i = 0; i < sizeof(int); i++)
{
buffer[i] = messageLength[i];
}
// Print the message into the buffer
for (int i = 0; i < message.Length; i++)
{
buffer[i + sizeof(int)] = message[i];
}
// Send it
stream.Write(buffer, 0, buffer.Length);
}
You can use Server-Sent Events.
var source = new EventSource('/EventSource');
source.onmessage = function(e) {
var data = JSON.parse(e.data);
console.log(e.data);
};
source.onopen = function(e) {
console.log('EventSource opened');
};
source.onerror = function(e) {
console.log('EventSource error');
};
精彩评论