I submitted one of our OJS-based e-journals (PCSP - http://pcsp.libraries.rutgers.edu/
) to Google Scholar a few weeks ago. Yesterday, I got an e-mail from the Google Scholar people, with the following:
If the documents on your site are freely available to everyone, all we need to do is to make sure that our crawlers can reach all the documents – i.e., all the content should be reachable by following links. If you have tables of content or a browsing interface that allows users to reach all documents by following links, then your site is already suitable for crawling. If on the other hand, the documents on your site are available only by searching via a search form, we will need to work together to make sure our crawlers can reach the content. In your case it would be great if you could create a simple link-based browsing interface for archives instead of the menu that you have in place right now. Menus can be tricky at times for crawlers. Do you think you could implement these changes?
I can send guidelines for this over email and we can follow that up with discussion/questions if needed.
If the documents on your site are available only to some people (possibly members of the campus community), we can work with you to index full text content while preserving your access-control regimen. For this, our crawlers will need access to the full text of the documents. This can be usually be implemented by an IP-based access control mechanism (crawlers are unable to handle passwords).
We also require that users that click on links to restricted-access documents be able to see at least an abstract.
Now the problem with OJS might be that the standard Archives page uses a drop-down list for the older issues. This means that the Google crawler won't be able to find the articles. So now I am working on a single page that generates a list of all the articles, authors, links to abstract pages, etc. by doing a DB query.
With the OJS-based journal Nordlyd (http://www.ub.uit.no/munin/nordlyd/index.php?locale=en
) they seem to have solved this problem by generating a list of links to the most recent issues on the frontpage. I remember that I saw one other journal where they got rid of the standard archives page, but I forgot the title of this journal.