how to parse this text in c#

2023-01-10 20:42 问答作者：

abc  = tamaz feeo maa roo key gaera porla
Xyz = gippaza eka jaguar ammaz te sanna.

i want to make a struct

public struct word
{
 public string Word;
 public string开发者_Python百科 Definition;
}

how i can parse them and make a list of <word> in c#.

how i can parse it in c#

thanks for help but it is a text and it is not sure that a line or more so what i do for newline

Read the input line by line and split by the equal sign.

class Entry
{
    private string term;
    private string definition;

    Entry(string term, string definition)
    {
        this.term = term;
        this.definition = definition;
    }
}

// ...

string[] data = line.Split('=');
string word = data[0].Trim();
string definition = data[1].Trim();

Entry entry = new Entry(word, definition);

This can also be done using a very simple LINQ query:

var definitions =
    from line in File.ReadAllLines(file)
    let parts = line.Split('=')
    select new word
        {
            Word = parts[0].Trim(),
            Definition = parts[1].Trim()
        }

Using RegExp you can proceed in two ways, depending on your source input

Exemple 1

Assuming you have read your source and saved any single line in a vector or list :

string[] input = { "abc  = tamaz feeo maa roo key gaera porla", "Xyz = gippaza eka jaguar ammaz te sanna." };

 Regex mySplit = new Regex("(\\w+)\\s*=\\s*((\\w+).*)");

 List<word> mylist = new List<word>();

 foreach (string wordDef in input)
 {
      Match myMatch = mySplit.Match(wordDef);

      word myWord;

      myWord.Word = myMatch.Groups[1].Captures[0].Value;
      myWord.Definition = myMatch.Groups[2].Captures[0].Value;

       mylist.Add(myWord);
 }

Exemple 2

Assuming you have read your source in a single variable (and any line is terminated with the line break character '\n') you can use the same regexp "(\w+)\s*=\s*((\w+).*)" but in this way

string inputs = "abc  = tamaz feeo maa roo, key gaera porla\r\nXyz = gippaza eka jaguar; ammaz: te sanna.";

MatchCollection myMatches = mySplit.Matches(inputs);

foreach (Match singleMatch in myMatches)
{

    word myWord;

    myWord.Word = singleMatch.Groups[1].Captures[0].Value;
    myWord.Definition = singleMatch.Groups[2].Captures[0].Value;

    mylist.Add(myWord);
}

Lines that matches or does not match the regexp "(\w+)\s=\s*((\w+).)":

"abc = tamaz feeo maa roo key gaera porla,qsdsdsqdqsd\n" --> Match!
"Xyz= gippaza eka jaguar ammaz te sanna. sdq=sqds \n" --> Match! you can insert description that includes spaces too.
"qsdqsd=\nsdsdsd\n" --> Match a multiline pair too!
"sdqsd=\n" --> DO NOT Match! (lacking descr)
"= sdq sqdqsd.\n" --> DO NOT Match! (lacking word)

// Split at an = sign. Take at most two parts (word and definition); 
//    ignore any = signs in the definition
string[] parts = line.Split(new[] { '=' }, 2);

word w = new word();
w.Word = parts[0].Trim();

// If the definition is missing then parts.Length == 1
if (parts.Length == 1)
    w.Definition = string.Empty;
else
    w.Definition = parts[1].Trim();

words.Add(w);

Use Regular Expressions

继续阅读：parsing

how to parse this text in c#

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？