PDF to HTML plugin

OJS development discussion, enhancement requests, third-party patches and plug-ins.

Moderators: jmacgreg, btbell, michael, bdgregg, barbarah, asmecher

Forum rules
The Public Knowledge Project Support Forum is moving to http://forum.pkp.sfu.ca

This forum will be maintained permanently as an archived historical resource, but all new questions should be added to the new forum. Questions will no longer be monitored on this old forum after March 30, 2015.
Posts: 6
Joined: Tue Dec 03, 2013 5:07 pm

PDF to HTML plugin

Postby JakeEWB » Tue Dec 03, 2013 5:30 pm

Hi all.

I'm interested in creating a plugin that converts an uploaded PDF file to HTML format. Once it has been converted I want to asynchronously load the HTML doc by section (i.e abstract, introduction, method, etc).
I was wondering if anyone could point in me in some directions. I'm aware of the XML Galley Plugin which converts XML to an XHTML galley, would anyone reccomend this as the avenue to go down? Or perhaps exporting the PDF, converting it on the server, then uploading the HTML?


Posts: 10015
Joined: Wed Aug 10, 2005 12:56 pm

Re: PDF to HTML plugin

Postby asmecher » Wed Dec 04, 2013 10:06 am

Hi JakeEWB,

The XML galleys plugin is not a bad place to start, though it has a few noted sections in the code that are known to be broken. I would suggest starting by experimenting with your conversion toolchain directly before you get too far into the details of plugin coding, as you may find that automatic PDF to HTML conversion suffers from poor quality output, depending on the process you're using to create PDFs in the first place.

Alec Smecher
Public Knowledge Project Team

Posts: 6
Joined: Tue Dec 03, 2013 5:07 pm

Re: PDF to HTML plugin

Postby JakeEWB » Wed Dec 04, 2013 8:36 pm

Hi Alec!

I'll concentrate further on the PDF to HTML conversion for now.

Thanks for the help!

Return to “OJS Development”

Who is online

Users browsing this forum: No registered users and 2 guests