by asmecher » Mon Oct 29, 2007 9:31 am
Hi sarangi,
Harvester supports multiple metadata formats as plugins; if you harvest Arxiv using, for example, Dublin Core, the PDF isn't included, although a URL to the PDF should be. To get Harvester to fetch the PDFs as part of the harvesting process, I'd suggest writing a preprocessor plugin.
Regards,
Alec Smecher
Public Knowledge Project Team