March 28, 2007

Thesis types in repositories

Filed under: E-Theses and ETD conference,Harvesting — Neil Godfrey @ 6:53 am

Australian-NZ repositories need to be able to separate their ADT material into a single Set for ADT harvesting.

Repository Metadata for thesis so far is generally in the following places:

Marc: 502

DC: type or DC:description

The ETD standard metadata is to use and qualifiers.

It’s probably a good idea to have ETD-MS in a thesis record for more than just ADT purposes. And if ETD-MS is in there, then ideally that could inform the RDF of the types of theses that are required for the ADT Set.

But how to get the ETD-MS data?

Most 502 MARC fields I know enter the generic “Thesis” instead of the name of the Thesis in the first part.

502 $a Thesis (PhD) — USQ, 2005

Can we get a program to read a MARCXML 502 field to translate the (PhD) part to be turned into “Doctor of Philosophy” as the value for the element, and then again into “doctoral” for the element?

What work would be involved? Thinking of additional namespaces and things in the data.

March 19, 2007

Harvesting SETS in repositories

Filed under: E-Theses and ETD conference,Harvesting,Repositories — Neil Godfrey @ 5:51 am

So DSpace and EPrints can generate Sets to enable selective harvesting of their records. So one can limit harvest commands to surveying just their thesis collection, or just their preprints, or just one particular subject code.

And that’s just like OAI-PMH says they should.

So why can’t the VITAL repository do the same? I asked our programmer here about it and was informed that Fedora is not set up to handle OAI Sets. That was a bit of a jaw-dropper. I’m a non-techie and had understood from all the techies told me that the Fedora base is the best every way. But if it cannot handle Sets then that’s a very huge argument for someone to opt for a non-Fedora DSpace or Eprints repository.

But its crazy that a major repository system cannot handle sets. Sets are surely one of the most fundamental reasons one wants a repository in the first place.

Anyway, our programmer is now looking to see how the RDF code in Fedora can be configured to talk with OAI commands. Apparently needs a special download from the Fedora site (an oaiprovider zip file).

Sounds promising. But why the hell hasn’t this happened before now with the VITAL product? Looking forward to bringing it up to speed here at RUBRIC. Should be good.

March 7, 2007