OJS OCS OMP OHS

You are viewing the PKP Support Forum | PKP Home Wiki



Harvesting first time out: less records than expected?

Open Harvester Systems support questions and answers, bug reports, and development issues.

Moderators: jmacgreg, michael, John

Forum rules
Developer Resources:

Git: You can access our public Git Repository here. Comprehensive Git usage instructions are available on the wiki.

Bugzilla: You can access our Bugzilla report tracker here.

Search: You can use our Google Custom Search to search across our main website, the support forum, and Bugzilla.

Questions and discussion are welcome.

Harvesting first time out: less records than expected?

Postby davidWalker » Wed Jan 06, 2010 4:39 pm

Sorry, new to PKP Harvester, and harvesting in general.

So I'm looking to harvest from this DSpace repository:

http://sjsu-dspace.calstate.edu/

It has 3,000+ records. But when I do a flush and an update (both from the web and command line) I only get 98 records. No dates or sets selected. Chose 'all sets' in the web interface. Still always 98.

And the ListRecords verb for the archive shows 98:

http://sjsu-dspace.calstate.edu/oai/req ... fix=oai_dc

Am I missing something obvious here?
davidWalker
 
Posts: 6
Joined: Wed Jan 06, 2010 2:24 pm

Re: Harvesting first time out: less records than expected?

Postby davidWalker » Wed Jan 06, 2010 5:36 pm

I wonder if this is maybe a Dspace issue.

Reading around a bit in the DSpace docs, I see now why this initial response only contains 100 records. But shouldn't it include a resumption token?

Will ask on the Dspace list as well.
davidWalker
 
Posts: 6
Joined: Wed Jan 06, 2010 2:24 pm

Re: Harvesting first time out: less records than expected?

Postby davidWalker » Thu Jan 07, 2010 11:30 am

This was, in fact, a bug in DSpace 1.5.2. We have it working now.

For posterity, George Kozak from Cornell had this to say on the dspace-tech listserv:

"Yes, I did fix my problem. Stuart Lewis from the Auckland Library told me that my problem was caused by the 'harvest.includerestricted.oai = false' option in the dspace.cfg file. I guess there's a bug in 1.5.2 that's fixed in 1.6. All I did was comment out the line and the harvesting now works fine."
davidWalker
 
Posts: 6
Joined: Wed Jan 06, 2010 2:24 pm


Return to Open Harvester Systems Support and Development

Who is online

Users browsing this forum: Google [Bot] and 2 guests