OJS OCS OMP OHS

You are viewing the PKP Support Forum | PKP Home Wiki



Is search in PDF documents possible?

Are you responsible for making OJS work -- installing, upgrading, migrating or troubleshooting? Do you think you've found a bug? Post in this forum.

Moderators: jmacgreg, btbell, michael, bdgregg, barbarah, asmecher

Forum rules
What to do if you have a technical problem with OJS:

1. Search the forum. You can do this from the Advanced Search Page or from our Google Custom Search, which will search the entire PKP site. If you are encountering an error, we especially recommend searching the forum for said error.

2. Check the FAQ to see if your question or error has already been resolved.

3. Post a question, but please, only after trying the above two solutions. If it's a workflow or usability question you should probably post to the OJS Editorial Support and Discussion subforum; if you have a development question, try the OJS Development subforum.

Is search in PDF documents possible?

Postby tello » Mon Jul 17, 2006 6:36 am

Hi everybody;


Nowadays ,after installing our OJS system successfully, we are trying to create our back issues. My question is that ; Our system has no capability to make search in PDF documents. Is there any way to make it possible.In forum I found somethings but I did not understand anything. It is very important for us because we will not produce HTML forms of our back issues.

Thanks in advance, tello
tello
 
Posts: 8
Joined: Sun Apr 23, 2006 2:16 pm
Location: Turkey,METU

Postby asmecher » Mon Jul 17, 2006 9:33 am

Hi tello,

OJS uses external tools to extract text from PDF files for searching. In OJS's config.inc.php file, you need to specify where OJS can find these tools. For example, the stock configuration file includes the lines:
Code: Select all
; PDF
; index[application/pdf] = "/usr/bin/pstotext %s"
; index[application/pdf] = "/usr/bin/pdftotext %s -"
These lines are commented out by default -- if your server has the tool /usr/bin/pdftotext installed, un-comment that line:
Code: Select all
; PDF
; index[application/pdf] = "/usr/bin/pstotext %s"
index[application/pdf] = "/usr/bin/pdftotext %s -"

Once you've changed the configuration file, you can rebuild the text index by running "php tools/rebuildSearchIndex.php" from the command line.

Regards,
Alec Smecher
Open Journal Systems Team
asmecher
 
Posts: 7746
Joined: Wed Aug 10, 2005 12:56 pm

Postby tello » Mon Jul 17, 2006 12:55 pm

Thanks Alec ;


I have tried your what you told but it did not work. I think as you sad

These lines are commented out by default -- if your server has the tool /usr/bin/pdftotext installed, un-comment that line:


maybe our server doesn't have this option installed how can I be sure whether it is installed or not in my server.

Also I told you that I tried but I think only uncommenting the lines is nt enough because in my server the path is also different. What can we do further?.

Thanks in advance,tello
tello
 
Posts: 8
Joined: Sun Apr 23, 2006 2:16 pm
Location: Turkey,METU

Postby asmecher » Mon Jul 17, 2006 1:10 pm

Hi tello,

You're probably best off asking a server administrator if and where these tools are installed. OJS will support any command-line tool that converts PDFs into text, such as pdftotext (part of the xpdf toolset) and ps2ascii (part of the ghostscript toolset). One or both of these are often installed on *NIX servers, but the location and details depend on the operating system. If you're using a Windows server, you'll probably have to install one of these manually. I'd suggest getting the xpdf toolset (see http://www.foolabs.com/xpdf/download.html).

Regards,
Alec Smecher
Open Journal Systems Team
asmecher
 
Posts: 7746
Joined: Wed Aug 10, 2005 12:56 pm


Return to OJS Technical Support

Who is online

Users browsing this forum: No registered users and 2 guests