XML Publishing Roadmap

From PKP Wiki
Revision as of 06:50, 6 March 2009 by Admin (Talk | contribs)

Jump to: navigation, search

XML Publishing Roadmap

The PKP, in conjunction with RIT OPL, and UNB ETC, will be extending our publishing support to include XML workflows. The goal is to have our software ingest XML (initially NLM Journal Publishing Tag Set Version 2.3 with Docbook, TEI and Erudit to follow) and render/display HTML and PDF. Bugzilla tracking entry for XML publishing

Goals

  • Support Science (NLM) and SSH (Erudit, Docbook, TEI) publishing formats
  • Produce both HTML and PDF rendering
  • Integrate, where appropriate, with: L8X (preview), OJS, and OMP

Roadmap

Phase 1

  • Bring XML galleys plugin up to date wrt. localization changes in OJS 2.x
  • Modify plugin to generate galleys from an uploaded "layout" file rather than "galley" file
  • Move XSL code from the plugin into the PKP WAL
  • Determine work required to refactor Molecular Vision XSL2.0 to XSL1.0
  • Examine and refactor/merge XSL files for HTML generation from Monash ePress, Open Medicine, Molecular Vision

Phase 2

  • Examine and refactor XSL files for PDF generation from Molecular Vision
  • Improvements to plugin/XSL classes for supporting XSL2.0 via Saxon

Phase 3

  • Extend the current xmlGalleys plugin (which already supports NLM) to support Docbook and TEI based on XSL from Monash, UNB, UVic

Phase 4

  • Extend/parameterize XSLs for HTML and PDF to produce customizable/SSH layout in addition to STM layout
  • Update XSL rendering to meet layouts developed by RIT

Documentation

  • Notes on Monash contribution

Current XML workflow Proposed XML workflow
Current and proposed XML rendering workflows


Workflow for conversion to NLM DTD

Workflow for conversion to NLM DTD

Sample XSLT customization process

Sample XSLT customization process

Feature List

Feature list broken down by level of difficulty

Procedural StepFeatureSimpleDifficult
1. Various XML to NLM Docbook->NLM transformation Requires proper XSL files

- have been provided by Monash

- can't handle Docbook 5
To support Docbook 5 we would have to substantially rework (or recreate) the files provided by Monash
TEI->NLM transformation Requires proper XSL files
- will be provided by UVic (will presumably handle Martin's TEIJournal modification; be back-portable to TEI P5)
 
Erudit->NLM transformation Requires proper XSL files
- possibly provided by UNB?
 
2. NLM to HTML/PDF galleys NLM->HTML transformation Requires proper XSLT 1.0 files

- Monash has provided XSLT 1.0 files

- Current plugin already using XSLT 1.0 files based on NLM's tools
XSLT 2.0 transformation

- UVic would prefer to work with XSLT 2.0 - Molecular Vision has provided XSLT 2.0 files

- XSLT 2.0 would require using Java to use a 2.0-compatible XSLT engine (ie. Saxon)
XML->XSL:FO->PDF Not possible Requires Java
- Sample XSLT 1.0 in current plugin