开发者

How to define grammar which excludes a certain set of words?

开发者 https://www.devze.com 2023-02-23 02:26 出处:网络
I have built a small code for static analysis of C code. The purpose of building it is to warn users about the use of methods such as strcpy() which could essentially cause buffer overflows.

I have built a small code for static analysis of C code. The purpose of building it is to warn users about the use of methods such as strcpy() which could essentially cause buffer overflows.

Now, to formalise the same, I need to write a formal Grammar which shows the excluded libraries as NOT a part of the allowed set of accepted library methods used.

For example,

AllowedSentence->ANSI C Permitted Code, NOT UnSafeLibraryMethods

UnSafeLibraryMethods->strcpy|o开发者_JAVA百科ther potentially unsafe methods

Any ideas on how this grammar can be formalised?


I think, this should not be done at the grammar level. It should be a rule that is applied to the parse tree after parsing is done.


You hardly need a parser for the way you have posed the problem. If your only goal is to object to the presence of certain identifiers ("strcpy"), you can simply build a lexer that processes C and picks identifiers. Special lexemes can recognize your list of "you shouldn't use this". This way you use positive recognition instead of negative recognition to pick out the identifiers that you belive to be trouble.

If you want a more sophisticated analaysis tool, you'll likely want to parse C, an name-resolve the identifers to their actual definitisn, then the scan the tree looking for identifiers that are objectionable. This will at least let you decide if the identifier is actually defined by the user, or comes from some known library; surely, if my code defines strcpy, you shouldn't complain unless you know my strcpy is defective somehow.

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号