Java Regex that only replaces multiple whitepaces with Non-Breaking Spaces
I am looking for a Java regex way of replacing multiple spaces with non-breaking spaces. Two or more white开发者_运维百科spaces should be replaced with the same number of non-breaking spaces, but single whitespaces should NOT be replaced. This needs to work for any number of whitespaces. And the first characters could be 1 or more whitespaces.
So if my String starts off like this:
TESTING THIS OUT WITH DIFFERENT CASES
I need the new String to look like this:
TESTING THIS OUT WITH DIFFERENT CASES
Let's use some regex (black ?) magic.
String testStr = "TESTING THIS OUT WITH DIFFERENT CASES";
Pattern p = Pattern.compile(" (?= )|(?<= ) ");
Matcher m = p.matcher(testStr);
String res = m.replaceAll(" ");
The pattern looks for either a whitespace followed by another one, or a whitespace following another one. This way it catches all blanks in a sequence. On my machine, with java 1.6, I get the expected result:
TESTING THIS OUT WITH DIFFERENT CASES
You could also skip the regex all together.
String testStr = "TESTING THIS OUT WITH DIFFERENT CASES";
String _replaced = testStr.replace(" ", " ");
String replaced = _replaced.replace(" ", " ");
I haven't tested this but the first one finds all cases of two spaces and replaces them with non-breaking spaces. The second finds cases where there were an odd number of white-spaces and corrects it with two nbsps.
Edit: This does not handle punctuation, and reworking it to handle punctuation would require it to use the same approach as Sergio's answer but with two steps instead of one. Therefore, this is an inadequate answer and has been withdrawn.
Original answer below:
The most straightforward way that I can think of is a two-step method.
First, replace all spaces with " ". This is pretty fast because it doesn't have to be a regex.
String testStr = "TESTING THIS OUT WITH DIFFERENT CASES";
String replaced = testStr.replace(" ", " ");
Next, replace any single instances of " " with spaces.
String replaced2 = replaced.replaceAll("\\b \\b", " ");
System.out.println(replaced2);
Result:
TESTING THIS OUT WITH DIFFERENT CASES
精彩评论