开发者

How do I loop through results from std::regex_search?

开发者 https://www.devze.com 2023-04-02 15:14 出处:网络
After calling std::regex_search, I\'m only able to get the first string result from the std::smatch for some reason:

After calling std::regex_search, I'm only able to get the first string result from the std::smatch for some reason:

Expression.assign("rel=\"nofollow\">(.*?)</a>");
if (开发者_C百科std::regex_search(Tables, Match, Expression))
{
    for (std::size_t i = 1; i < Match.size(); ++i)
        std::cout << Match[i].str() << std::endl;
}

So I tried to do it another way - with an iterator:

const std::sregex_token_iterator End;
Expression.assign("rel=\"nofollow\">(.*?)</a>");
for (std::sregex_token_iterator i(Tables.begin(), Tables.end(), Expression); i != End; ++i)
{
    std::cout << *i << std::endl;
}

This does go through every match, but it also gives me the whole matching string instead of just the capture that I was after. Surely must be another way than having to do another std::regex_search on the iterator element in the loop?

Thanks in advance.


regex_token_iterator takes an optional fourth argument specifying which submatch is returned for each iteration. The default value of this argument is 0, which in case of the C++ (and many other) regexes means "the whole match". If you want to get the first captured submatch, simply pass 1 to the constructor:

const std::sregex_token_iterator End;
Expression.assign("rel=\"nofollow\">(.*?)</a>");
for (std::sregex_token_iterator i(Tables.begin(), Tables.end(), Expression, 1); i != End; ++i)
{
    std::cout << *i << std::endl; // *i only yields the captured part
}


std::regex_search searches for the regex just once. It does not return a list of matches, but a list of submatched expressions (those within parentheses). This is why you only get one Match[1], the text inside the link tag.

As for the second code, it actually returns you all the matches, but it returns you again match_results object, so you have to use the [] operator:

const std::sregex_iterator End;
Expression.assign("rel=\"nofollow\">(.*?)</a>");
for (std::sregex_iterator i(Tables.begin(), Tables.end(), Expression); i != End; ++i)
{
    std::cout << (*i)[1] << std::endl; // first submatch, same as above.
}
0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号