开发者

Best way to parse a log file in C#

开发者 https://www.devze.com 2023-01-23 11:07 出处:网络
I have the following log file: START:SOME_STRING BL:2 LK:3 LH:5 end START:SOME_STRING BL:5 LK:6 LH:6 end Which has multiple START: -> end structures inside. Is there a better \'non-slop开发者_开发知

I have the following log file:

START:SOME_STRING
BL:2
LK:3
LH:5
end
START:SOME_STRING
BL:5
LK:6
LH:6
end

Which has multiple START: -> end structures inside. Is there a better 'non-slop开发者_开发知识库py' way of parsing this file rather than reading line by line and using SPLIT?


You can try to formalize your ini-file's grammar, and you some of parser generators. See this question for more detail.

Be aware howeveer that for such a simple grammar as yours it might be easier to parse manually :-P

class IniEntry
{
    public int BL;
    public int LK;
    public int LH;
    IniEntry Clone() { return new IniEntry { BL = BL, LK = LK, LH = LH }; }
}

IEnumerable<IniEntry> Parse()
{
    IniEntry ie = new IniEntry();
    while (ParseEntry(out ie))
        yield return ie.Clone();
}

bool ParseEntry(out IniEntry ie)
{
    ie = new IniEntry();
    return ParseStart(ie) &&
               ParseBL(ie) &&
               ParseLK(ie) &&
               ParseLH(ie) &&
               ParseEnd(ie);
}

bool ParseStart(IniEntry ie)
{
    string dummy;
    return ParseLine("START", out dummy);
}

bool ParseBL(IniEntry ie)
{
    string BL;
    return ParseLine("BL", out BL) && int.TryParse(BL, out ie.BL);
}

bool ParseLK(IniEntry ie)
{
    string LK;
    return ParseLine("LK", out LK) && int.TryParse(LK, out ie.LK);
}

bool ParseLH(IniEntry ie)
{
    string LH;
    return ParseLine("LH", out LH) && string.TryParse(LH, out ie.LH);
}

bool ParseLine(string key, out string value)
{
    string line = GetNextLine();
    var parts = line.Split(":");
    if (parts.Count != 2) return false;
    if (parts[0] != key) return false;
    value = parts[1];
}

etc.


This is a good candidate for a while loop and a state machine. With this approach you would use even use less memory and have greater performance than using string.split()


If it is certain that the START/END are always matched, (apologies, my C# is embarrassing, so plain English):

Read the whole file with System.IO.ReadToEnd
Parse the whole thing in one go with a regular expression
Iterate over regex results

The regex would be something like "(START:([^$]+)$BL:([^$]+)$LK:([^$]+)$LH:([^$]+)$end$)+", off the top of my head, you'll need to validate/adjust according to how your parameters BL/LK etc. occur

0

精彩评论

暂无评论...
验证码 换一张
取 消