Hello,
My python parser tools (expat) are having kittens when they try to parse undefined entities (&svAres; for example). I am unaware of a python library that will just *parse* them without complaint.
This is solveable by providing DTDs for the .xml files. Are there any?
Those of you that parse the .xml files looking for various stats, what do you use?
Thanks.
Parsing the .xml files
I wrote my own in JAVA. The solution to the DTD problem for me was to use the original DTD for the resolving the enitities in the modules. I haven't released a full version yet, but I can look into releasing a version that would simply compile the xml in to a DOM object if you would like.