MBooks, revisited

This posting makes available a stylesheet to render MARCXML from a collection of records called MBooks.

In a previous post — get-mbooks.pl — I described how to use OAI-PMH to harvest MARC records from the MBooks project. The program works; it does what it is suppose to do.

The MBooks collection is growing so I harvested the content again, but this time I wanted to index it. Using an indexer/search engine called Zebra, the process was almost trivial. (See “Getting Started With Zebra” for details.)

Since Zebra supports SRU (Search/Retrieve via URL) out of the box, searches against the index return MARCXML. This will be a common returned XML stream for a while, so I needed to write an XSLT stylesheet to render the output. Thus, mbooks.xsl was born.

What is really “kewl” about the stylesheet is the simple inline Javascript allowing the librarian to view the MARC tags in all their glory. For a little while you can see how this all fits together in a simple interface to the index.

Use mbooks.xsl as you see fit, but remember “Give back to the ‘Net.”

Tags: ,

3 Responses to “MBooks, revisited”

  1. […] MBooks, revisited / Eric Lease Morgan – Infomotions Mini-Musings » Blog Archive (tags: XML xsl webservice bibliothèques library SRW/SRU MARC catalogue innovation javascript books numerisation search OAI-PMH OPAC) […]

  2. Jeffrey Beall says:

    We loaded the MARC records into our catalog. I am unable to find the title in your database that corresponds to the record below:

    http://skyline.cudenver.edu/record=b2221498

    I realize your database is an experiment, but I was having trouble finding things with it. It appears to index titles on the article.

    Thanks.

  3. Jeffrey, thank you for the comment, but I am unable to explain the discrepancy.

    I just finished harvesting content again, and I downloaded about 115,000 records. When I indexed the data, the indexer thought some of the data was invalid around the 113,000 record mark. (No puns intended.) Maybe your import routine accepted these extra records, and maybe the record mentioned above fell into that category. Strange.