开发者

String encoding of scandinavian letter from url to utf8 to const char on iphone

NSString *theString = @"a %C3%B8 b";

NSLog(@"%@", theString);

NSString *utf8string = [theString stringByReplacingPercentEscapesUsingEncoding: NSUTF8StringEncoding]

NSLog(@"%@", utf8string);

const char *theChar = [utf8string UTF8String];

NSLog(@"%s", theChar);

This logs the following:

'a %C3%B8 b'

'a ø b'

'a √∏ b'

The problem is tha开发者_C百科t I want theChar to be 'a ø b'. Any help on how to achieve that would be greatly appreciated.


I don't think you can. char is a eight bit type so all values are between 0-255. In UTF8 the ø is not encoded in that range.

You might want to look at the unicode type which is a 16 bit type. This can hold the ø as one item and use getCharacters:range: to get the characters out of the NSString


From String Format Specifiers in String Programming Guide:

%s : Null-terminated array of 8-bit unsigned characters. %s interprets its input in the system encoding rather than, for example, UTF-8.

So NSLog(@"%s", theChar) creates and displays NSString object with wrong encoding and theChar itself contains correct string data.

NSLog([NSString stringWithUTF8String:theChar]);

Gives the correct output. (a ø b)


I'd like to add that your theChar does contain the UTF8 byte sequence of your desired string. It's the problem of NSLog("%s") that it can't show the string correctly into the log file and/or the console.

So, if you want to pass the UTF8 byte sequence in char* to some other library, what you did is perfectly correct.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜