开发者

Strange behavior of String's matches() method

I encountered an interesting issue about String's matches(RegExp) method.

assertTrue("33CCFF".matches("[0-9A-Za-z]{6}"));
assertTrue("CC33FF".matches("[0-9A-Za-z]{6}"));
assertTrue("CC3355".matches("[0-9A-Za-z]{6}"));
assertTrue("CC9955".matches("[0-9A-Za-z]{6}"));
assertTrue("CC3366".matches("[0-9A-Za-z]{6}"));
assertTrue("CC3965".matches("[0-9A-Za-z]{6}"));
assertTrue("CC1961".matches("[0-9A-Za-z]{6}"));
assertTrue("CC9999".matches("[0-9A-Za-z]{6}"));
assertTrue("СС3966".matches("[0-9A-Za-z]{6}")); // failing
assertTrue("СС9965".matches("[0-9A-Za-z]{6}")); // failing
as开发者_如何学GosertTrue("СС9966".matches("[0-9A-Za-z]{6}")); // failing

The last 3 assertion is failing unexpectedly. I couldn't find any reasons why this weird behavior is happening. Do you have the same issue? Do you have any ideas?

By the way, in case of being asked, my java version is the following.

java version "1.6.0_26"
Java(TM) SE Runtime Environment (build 1.6.0_26-b03)
Java HotSpot(TM) 64-Bit Server VM (build 20.1-b02, mixed mode)


The last three don't actually start with an ASCII "C" character. They start with a non-ASCII character which looks like "C". That doesn't match anything in the [0-9A-Za-z] set, hence the pattern fails.

(I found this out by copying and pasting the code into a text editor which doesn't handle non-ASCII characters terribly well - they came out as "?".)


your "СС3966" (I'm cutting and pasting) are getting flagged as non UTF-8, which is why reg-ex isn't matching them. When I change your text and just type it myself it works as expected. Not sure where you copied these values from, but that's your problem

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜