开发者

How to implement a collection (list, map?) of complicated strings in Java?

Problem -- I have something like the following entries, 1000 of them:

args1=msg args2=flow args3=content args4=depth args6=within ==> args5=content
args1=msg args2=flow args3=content args4=depth args6=within args7=distance ==> args5=content
args1=msg args2=flow args3=content args6=within ==> args5=content
args1=msg args2=flow args3=content args6=within args7=distance ==> args5=content
args1=msg args2=flow a开发者_开发问答rgs3=flow ==> args4=flowbits
args1=msg args2=flow args3=flow args5=content ==> args4=flowbits
args1=msg args2=flow args3=flow args6=depth ==> args4=flowbits
args1=msg args2=flow args3=flow args6=depth ==> args5=content
args1=msg args2=flow args4=depth ==> args3=content
args1=msg args2=flow args4=depth args5=content ==> args3=content
args1=msg args2=flow args4=depth args5=content args6=within ==> args3=content
args1=msg args2=flow args4=depth args5=content args6=within args7=distance ==> args3=content

I'm doing some sort of suggestion method. Say, args1=msg args2=flow args3=flow ==> args4=flowbits

If the sentence contains msg, flow, and another flow, then I should return the suggestion of flowbits.

How can I go around doing it? I know I should scan (whenever a character is pressed on the textarea) a list or array for a match and return the result, but, 1000 entries, how should I implement it?

I'm thinking of HashMap, but can I do something like this?

<"msg,flow,flow","flowbits">

Also, in a sentence the arguments might not be in order, so assuming that it's flow,flow,msg then I can't match anything in the HashMap as the key is "msg,flow,flow".

What should I do in this case? Please help. Thanks a million!


A Map's key can be another Map or a Set. Looks like all you need is something like a Map<Set<String>, String> or perhaps a Map<Map<String, String>, Map.Entry<String, String>> - not sure where these "args1","args2" are relevant.


In other words, try not to do all your logic in your parser. Split up the logic so the parser is simply detecting the structure and then build objects to help you enforce the rules. A parser can easily detect the arguments and use them to create a list.

If you create an class to contain your arguments like so:

public class Arg {
    public int number;
    public String value;

    public Arg(int num, String val) {
        this.number = num;
        this.value = val;
    }

    @Override
    public String toString()
    {
   return "[Arg num=" + number + ", value=" + value + "]";
    }

}

then you can put those in a simple Hashtable.

Map<Arg> argList = new HashMap<Arg>();

Then you can do the logic using maybe a counter and contains() or indexOf() etc.

Having the Arg class makes sorting easy too. If you need the list sorted by the argument position, you create a Comparator for that.

import java.util.Comparator;

public class ArgNumComparator implements Comparator<Arg> {
    public int compare(Arg o1, Arg o2) {
       if (o1.number == o2.number) {
        return 0;
       }
       return o1.number < o2.number ? -1 : 1 ;
   }    
}

Sorting by the argument value is even easier since you can reuse the comparedTo() of Strings.

import java.util.Comparator;

public class ArgValComparator implements Comparator<Arg>
{
    public int compare(Arg o1, Arg o2)
   {
       return o1.value.compareTo(o2.value);
   }
}

Then, to do the sorting use the Collections.sort() like so:

import java.util.ArrayList;
import java.util.Collections;
import java.util.List;

public class ArgList{
    public static void main(String[] args)  {
        //args1=msg args2=flow args3=content args4=depth args6=within ==> args5=content
        List<Arg> l = new ArrayList<Arg>();  
        // hard-coded example instead of more likely parsing
        l.add(new Arg(1, "msg"));
        l.add(new Arg(2, "flow"));
        l.add(new Arg(3, "content"));
        l.add(new Arg(4, "depth"));
        l.add(new Arg(5, "flow"));
        l.add(new Arg(6, "within"));

    Collections.sort(l, new ArgValComparator()); // take your pick of comparators

    System.out.println(l); // uses the toString() of Arg.
    }
}

EDIT: added a toString() method to Arg and changed the list in the example to have two "flow" args.

Running with the new toString code puts the following to the console:

[[Arg num=3, value=content], [Arg num=4, value=depth], [Arg num=2, value=flow], [Arg num=5, value=flow], [Arg num=1, value=msg], [Arg num=6, value=within]]

As you can see, the two args with value="flow" are now next to each other. To detect multiple args where value="flow" can be done thus:

boolean flowFound = false;
for (Arg arg : l){
   if (arg.value.compareToIgnoreCase("flow") == 0) {
      if (flowFound)  //already found one? {
         // action when second "flow" exists
         System.out.println("2nd flow found");
      }
      else {
         flowFound = true;  // found the first "flow"
      }
   }          
}


yes, you can do <"msg,flow,flow","flowbits"> in a hashmap. Is it the best solution, I don't know.


At first glance, this looks like a good use for a parser and formal grammar rather than a collection. ANTLR is a popular parser generator for Java.

Where a parser won't solve your problem is if the arguments can appear in any order. In that situation I would use some sort of Case object that combines rules and actions, and use a simple Map<String,List<Case>> to find the instances that might apply to a given text (you'd extract individual words from the text to probe the map, and could combine the lists returned from each probe).

I don't have the time to give a complete example, but the Case object would probably look something like this:

public interface Case {
    boolean evaluate(String text);
    String result();
}


You could create an object and encapsulate the logic within it for each line. If the input is exactly as described, you should be able to extract the args with a simple regular expression and capturing groups, then call setters on your object for each arg. Then your data structure would just be a list of these objects.


Since the order that the strings appear is not important, you can alphabetize them when you create the key. Suppose you want the same suggestion for, msg,flow,flow and flow,msg,flow and flow,flow,msg -- alphabetized, they are "flow,flow,msg", so that is what you use as the key.

0

上一篇:

下一篇:

精彩评论

暂无评论...
验证码 换一张
取 消

最新问答

问答排行榜