by biomat » Sun Dec 28, 2008 12:25 am
I have tried extensively both the June 20, 2008 beta release installed on my server and the online version that seems to be more advanced on your server. I'm sorry to report that I could not get either of them to work satisfactorily with any of my data (manuscripts). I have three major complains about them.
1) The recognition/parsing algorithms do not work well (e.g., I get figures where they're not and no figures where they are, and the reference parsing algorithm doesn't even find year 2004 in "Cieszewski, C.J. 2004. ...", and so on).
2) The available function selections (e.g., radio buttons, list selections, and even text corrections in the parsing of the references) do not stick - don't get recorded.
3) The design of the manuscript data entry is wasteful and repetitive w/o any learning/correcting options.
I wanted to use this system so badly that I wasted several days on trying to figure out ways to make it work by editing the parsing, editing the manuscript, restructuring the manuscript, and other desperate actions. Mostly because of the above complain #2) I believe that neither of the two Lemon8-XML releases are ready to be classified as alpha versions yet. I believe this because alpha version should be good enough to at least execute existing links and commands that in case of such commands as "Save/Update" should at least save what was typed in. The way these systems are right now they're simply not usable or testable. The are merely kind of preview of some of the features of a future software development.
Based on a painful struggle with these "previews" I have the following suggestions for the Lemon8-XML program changes.
Ad. 1) The parsing program should look from the beginning of each reference for 4 digits and use it as the indicator of the year (the 4 digits), the authorship (left to the year) and the title (right to the year at least until the first period). Then look from the end of the reference for digits, periods, and dashes, there will be pages, issues, and volumes. Left to the ending digits is usually the outlet, and then you can use whatever you've got already, but if you do the above you'll have most of my citations already parsed. Perhaps you could have a switch or at least examples of the format you expect, which would help too.
Ad. 2) All the links, buttons, text entries, etc. should be recorded permanently when the Save or Update is clicked on or they should not be there.
Ad. 3) The program should allow for adding manual override definitions, such as LaTeX commands, with an option to use them instead of the try and fail guesswork on the title, authors, etc. For example, it would be much more intelligent if all the metadata were entered right on the manuscript in a database format, such as \Title{...}; \Author{...}; etc., and then parsed into the form rather than I had to re-enter all the metadata into the disposable disappearing form every time I rerun the same manuscript trying to edit it so that finally Lemon8 can gets it almost right. The same goes for figures and tables. The same thing could be done with the sections defined as \section{...}, rather than fighting the errors in recognition of the subsection font of the same size as the regular text, and finally, the same applies to the references that could be entered in an overriding BibTeX format for example.
Some of the suggested above solutions could be made optional (e.g., a switch between "Mouse Potato" vs. "LaTeX Freak";)
Best regards and good luck,
BTW, does anyone know if it is possible and how to convert a LaTeX, PDF, or PS, file into the XML format?