OJS OCS OMP OHS

You are viewing the PKP Support Forum | PKP Home Wiki



Harvesting of ojs

Open Harvester Systems support questions and answers, bug reports, and development issues.

Moderators: jmacgreg, michael, John

Forum rules
Developer Resources:

Git: You can access our public Git Repository here. Comprehensive Git usage instructions are available on the wiki.

Bugzilla: You can access our Bugzilla report tracker here.

Search: You can use our Google Custom Search to search across our main website, the support forum, and Bugzilla.

Questions and discussion are welcome.

Harvesting of ojs

Postby bubbleBoy » Tue Jul 19, 2011 6:28 pm

I am confused as what is harvesting of ojs means?
bubbleBoy
 
Posts: 46
Joined: Fri Apr 09, 2010 7:34 pm

Re: Harvesting of ojs

Postby jmacgreg » Wed Jul 20, 2011 1:27 pm

Hi bubbleboy,

I'm assuming you are referring to Journal Setup 3.5. OJS content is available via OAI; all you have to do is provide an OAI metadata harvester with your journal's OAI base URL, and it will automatically harvest your journal's metadata -- that is, all the information about your published articles, including author information; title; abstract; date published; keywords; and so on. The articles themselves are *not* harvested, but OAI harvesters will typically include a link back to your journal for full-text access (subscription-protected, of course, if your journal uses subscription protection). For more information, see Kevin's article Getting Found, Staying Found, Increasing Impact.

Cheers,
James
jmacgreg
 
Posts: 4181
Joined: Tue Feb 14, 2006 10:50 am

Re: Harvesting of ojs

Postby TSammy » Fri Aug 30, 2013 1:43 pm

Hi,
I'm having my OAI link from the Setup 3.5, it's http://prunz.org/ojs/index.php?journal=bube&page=oai. However, when I try to validate with http://www.openarchives.org/data/regist ... vider.html, it gives me FAIL (*), honnestly, I don't even know what to change. Please advise, many thanks in advance.

(*) [FAIL] Server at base URL 'http://prunz.org/ojs/index.php?journal=bube&page=oai' failed to respond to Identify. The HTTP GET request with URL http://prunz.org/ojs/index.php?journal= ... b=Identify received response code '404'.


--------------------------------------------------------------------------------
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby asmecher » Fri Aug 30, 2013 2:04 pm

Hi TSammy,

You'll need to configure your server so that you can turn off disable_path_info in config.inc.php; the OAI validator (and indeed the OAI protocol) assume that no other URL parameters are available besides those required for OAI, and disable_path_info requires additional ones to be set.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8316
Joined: Wed Aug 10, 2005 12:56 pm

Re: Harvesting of ojs

Postby TSammy » Fri Aug 30, 2013 7:51 pm

Thank you Alec.

I am using goddady server, it's Linux with apache and runs on MySQL. The reason I'm writing this I could not be able to locate the config.inc.php here, so I could not turn off the disable_path_info. Where is config.inc.php? I tried with phpadmin, DNS manager and some more. Please advice.

Best regards,

Sam
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby TSammy » Sat Aug 31, 2013 10:19 am

Hi Alec,

I was able to find config.inc.pmp, but when I turned off the disable_path_info, all my site pages went off with 404 error, except http://prunz.org/ojs/index.php?journal=bube&page=user. Please advise.

P.S.
I found the same, disable_path_info, in config.TEMPLATE.inc.php, but this was already off.

Many thanks in advance.
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby asmecher » Tue Sep 03, 2013 9:19 am

Hi Sam,

The disable_path_info option is available for certain servers that don't support the standard CGI "path info" variable, which permits OJS to use shorter, more attractive URLs. It sounds like your server is one of these. If you're able, adjust the server's configuration so that path info works properly; if not, you'll have to put up with uglier URLs and some areas of the system that may not interoperate fully (like OAI).

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8316
Joined: Wed Aug 10, 2005 12:56 pm

Re: Harvesting of ojs

Postby TSammy » Mon Sep 09, 2013 12:18 pm

Hi Alec,

Actually I found and turned off disable_path_info, it was -as you had mentioned- in config.inc file of the server, however it did not work, all I saw was 404.

Regarding your comments below, how can I adjust the server in respect of this? I mean what can I do else? And if, say I couldn't do this, what would be the "uglier" URL that I can apply? Please advise.

Many thanks,

Sam
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby asmecher » Mon Sep 09, 2013 2:43 pm

Hi Sam,

The configuration change you need to make will depend on your web server and SAPI. If you can identify both, I can probably find a link that will help.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8316
Joined: Wed Aug 10, 2005 12:56 pm

Re: Harvesting of ojs

Postby TSammy » Wed Sep 11, 2013 9:34 am

Thank you Alec. I run my OJS on Godaddy servers, with Apache Linux and MySQL. I hope this answers your question.
Many thanks in advance.
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby asmecher » Wed Sep 11, 2013 9:49 am

Hi TSammy,

That unfortunately will limit how much control you have over server configuration. However, Googling around shows a few options -- though I can't test them myself. See e.g. http://phphelp.co/2013/07/16/path_info-not-working-on-godaddy-how-to-enable-path_info-in-godaddy/.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8316
Joined: Wed Aug 10, 2005 12:56 pm

Re: Harvesting of ojs

Postby TSammy » Thu Sep 12, 2013 9:59 am

Thank you Alec. Here is what I did:

I followed the instructions on the link that you sent me. I created a php5.ini, put the text accordingly, and save into the Main Folder (hope this is the root folder). Then, I opened the sub folder 1.4, OJS, opened config.ini.php, and edit the On as "disable_path_info = Off". (You may check the folder structure below.) Now, when I try to register my baseURL (http://prunz.org/ojs/index.php?journal=bube&page=oai) with http://www.openarchives.org/data/regist ... vider.html, I don't see a 404 message anymore, which is great, but what I see is "Identify response does not have OAI-PMH as root element! Found node named 'html' instead.".

Is there anything else that I can apply to fix the issue?

Main Folder, No 1: html
Sub Folder No 1.1: [Apache Logs]
Sub Folder No 1.2: _db_backups
Sub Folder No 1.3: cgi
Sub Folder No 1.4: ojs

Kind Regards,
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby asmecher » Thu Sep 12, 2013 12:46 pm

Hi TSammy,

The goal of changing your PHP configuration is to enable OJS to operate properly with disable_path_info = Off. If you think the php.ini change is working, try turning off disable_path_info to check. If OJS still works, you'll notice that the URLs have changed considerably. Check again what your OAI base URL is and you'll see it'll have changed too. If this is all working, try the new OAI URL in the harvester.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8316
Joined: Wed Aug 10, 2005 12:56 pm

Re: Harvesting of ojs

Postby TSammy » Thu Sep 19, 2013 9:39 am

Hello Alec,

I did two-step validation tests (I'm not sure is there is the third) for my OJS main page, except the last sub-test of the second step, everything went perfectly. many thanks!! How can I proceed about this error below? Please advise. Many thanks in advance. (I use Apache on Godaddy servers with MySQL.)

Checking ListIdentifiers response
Request: GEThttp://prunz.org/ojs/index.php/index ... fix=oai_dc
[PASS] responseDate has correct format: 2013-09-13T19:18:29Z
[FAIL] The response to the ListIdentifiers verb contained no identifier elements. No further tests will be made.

Kind Regards,
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Re: Harvesting of ojs

Postby TSammy » Thu Sep 19, 2013 9:42 am

And very similar to my previous post, where it was about the OJS Main site's error, here is the error page with the Journal Site:

Checking ListIdentifiers response
Request: GEThttp://prunz.org/ojs/index.php/bube/ ... fix=oai_dc
[PASS] responseDate has correct format: 2013-09-13T19:02:25Z
[FAIL] The response to the ListIdentifiers verb contained no identifier elements. No further tests will be made.

Again the first step is fine, this error belongs to the last item of the 2nd step of the test.

Many thanks,
TSammy
 
Posts: 16
Joined: Fri Aug 30, 2013 1:34 pm

Next

Return to Open Harvester Systems Support and Development

Who is online

Users browsing this forum: Bing [Bot] and 4 guests