OJS OCS OMP OHS

You are viewing the PKP Support Forum | PKP Home Wiki



Inject metadata in xml file to harvester2

Open Harvester Systems support questions and answers, bug reports, and development issues.

Moderators: jmacgreg, michael, John

Forum rules
Developer Resources:

Git: You can access our public Git Repository here. Comprehensive Git usage instructions are available on the wiki.

Bugzilla: You can access our Bugzilla report tracker here.

Search: You can use our Google Custom Search to search across our main website, the support forum, and Bugzilla.

Questions and discussion are welcome.

Inject metadata in xml file to harvester2

Postby obi » Tue Oct 14, 2008 5:30 am

Hi,
http://www.oaister.org/ offers OAIster metadata via ftp. This is accessible at:
ftp://ftp.umdl.umich.edu/pub/records/oaister0229[a-z].tar.gz. We would like to translate the format and inject them to a Harvester2 system. Has anyone done such a work or related work?

Cheers
Obi
obi
 
Posts: 48
Joined: Wed Jun 09, 2004 5:56 am

Re: Inject metadata in xml file to harvester2

Postby asmecher » Tue Oct 14, 2008 9:27 am

Hi Obi,

I'm not familiar with that OAIster service -- what do the archives look like? If it's OAI static data (i.e. XML files conforming to the OAI Static Repository spec), the current release of the harvester can indeed accept the files.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8328
Joined: Wed Aug 10, 2005 12:56 pm

Re: Inject metadata in xml file to harvester2

Postby obi » Thu Oct 16, 2008 4:57 am

This is how the xml files look like when gunzip og untar (this is explained in http://www.oaister.org/sru.html):

<?xml version="1.0" encoding="UTF-8"?>
<BIBDB><GROUP NAME="zas">
<A ID="oai:ZASPIL:zp28001" DT="2005-09-08T13:31:30Z"><B><K>Acoustic Cues for the Korean Stop Contrast - Dialectal Variation</K><L>Choi, Hansook</L></B><E><YR>2002</YR><X>ZAS-Berlin</X></E><G><AA>http://www.zas.gwz-berlin.de/papers/zaspil/articles/28-1-choi.pdf</AA></G><J><URL>http://www.zas.gwz-berlin.de/papers/zaspil/articles/28-1-choi.pdf</URL></J><FMT>application/pdf</FMT><LANG>English</LANG><TYPE>arcticle</TYPE><INST>Zentrum fur Allgemeine Sprachwissenschaft, Typologie und Universalienforschung (ZAS) Archive</INST></A>
<A ID="oai:ZASPIL:zp29002" DT="2005-09-08T13:31:30Z"><B>..........</A>
........
</GROUP></BIBDB>

Harvesting Oister.org is only available via ftp. The metadata is packed in .tar.gz. I was thinking using DOM/PHP5 to inject the data in the xml files into Harvester2.

Currently Harvester2 uses a socket open and read to access remote HTTP. Would it simple to add an extension which allows the harvester to access ftp socket 21?

Cheers
Obi
obi
 
Posts: 48
Joined: Wed Jun 09, 2004 5:56 am

Re: Inject metadata in xml file to harvester2

Postby asmecher » Thu Oct 16, 2008 10:39 am

Hi Obi,

It should be possible to use URLs like ftp://hostname/dir in the Harvester as is; the trick will be getting the Harvester to recognize the data format you're describing, which is not currently supported (and doesn't look standard to me). You'd need to write a metadata format plugin (probably based off DC) to support it.

Regards,
Alec Smecher
Public Knowledge Project Team
asmecher
 
Posts: 8328
Joined: Wed Aug 10, 2005 12:56 pm


Return to Open Harvester Systems Support and Development

Who is online

Users browsing this forum: No registered users and 1 guest