Debugging
When the stuff goes wrong you will want to debug your parser.
Parser debug mode
During grammar design you can make syntax and semantic errors. Arpeggio will report any syntax error with all the necessary information whether you are building parser from python expressions or from a textual PEG notation.
For semantic error you have a debugging mode of operation which is entered by
setting debug
parameter to True
in the parser construction call.
parser = ParserPython(calc, debug=True)
When Arpeggio runs in debug mode it will print a detailed information of what it is doing.
>> Entering rule calc=Sequence at position 0 => *-(4-1)*5+(
>> Entering rule OneOrMore in calc at position 0 => *-(4-1)*5+(
>> Entering rule expression=Sequence in calc at position 0 => *-(4-1)*5+(
>> Entering rule term=Sequence in expression at position 0 => *-(4-1)*5+(
>> Entering rule factor=Sequence in term at position 0 => *-(4-1)*5+(
>> Entering rule Optional in factor at position 0 => *-(4-1)*5+(
>> Entering rule OrderedChoice in factor at position 0 => *-(4-1)*5+(
>> Match rule StrMatch(+) in factor at position 0 => *-(4-1)*5+(
-- No match '+' at 0 => '*-*(4-1)*5+('
>> Match rule StrMatch(-) in factor at position 0 => *-(4-1)*5+(
++ Match '-' at 0 => '*-*(4-1)*5+('
<< Leaving rule OrderedChoice
<< Leaving rule Optional
>> Entering rule OrderedChoice in factor at position 1 => -*(4-1)*5+(2
Visualization
Furthermore, while running in debug mode, a dot
file (a graph description file
format from GraphViz software
package) representing the parser
model ill be created if the parser model is constructed without errors.
This dot
file can be rendered as image using one of available dot viewer
software or transformed to an image using dot
tool
GraphViz software.
$ dot -Tpng -O calc_parser_model.dot
After this command you will get calc_parser_model.dot.png
file which can be
opened in any png
image viewer. This is how it looks like:
Each node in this graph is a parsing expression. Nodes are labeled by the type
name of the parsing expression. If node represents the rule from the grammar
the label is of the form <rule_name>=<PEG type>
where rule_name
it the
name of the grammar rule. The edges connect children expressions. The labels on
the edges represent the order in which the graph will be traversed during
parsing.
Furthermore, if you parse some input while the parser is in debug mode, the
parse tree dot
file will be generated also.
parse_tree = parser.parse("-(4-1)*5+(2+4.67)+5.89/(.2+7)")
This dot
file can also be converted to png
with the command:
$ dot -Tpng -O calc_parse_tree.dot
Which produces png
image given bellow.
You can also explicitly render your parser model or parse tree to dot
file
even if the parser is not in the debug mode.
For parser model this is achieved with the following Python code:
from arpeggio.export import PMDOTExporter
PMDOTExporter().exportFile(parser.parser_model,
"my_parser_model.dot")
For parse tree it is achieved with:
from arpeggio.export import PTDOTExporter
PTDOTExporter().exportFile(parse_tree,
"my_parse_tree.dot")
To get e.g. png
images from dot
files do as usuall:
$ dot -Tpng -O *dot
Note
All tree images in this docs are rendered using Arpeggio's visualization and
dot
tool from the GraphViz software.