<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Metalogger</title>
	<atom:link href="http://metalogger.wordpress.com/feed/" rel="self" type="application/rss+xml" />
	<link>http://metalogger.wordpress.com</link>
	<description>Neil Godfrey's passing thoughts and queries as a Metadata Specialist (RUBRIC), Repository Coordinator (Murdoch University), and Principal Librarian (Singapore National Library Board)</description>
	<lastBuildDate>Thu, 17 Sep 2009 04:15:42 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<cloud domain='metalogger.wordpress.com' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://www.gravatar.com/blavatar/6027799bb3a11ac547984671fd3a28cb?s=96&#038;d=http://s.wordpress.com/i/buttonw-com.png</url>
		<title>Metalogger</title>
		<link>http://metalogger.wordpress.com</link>
	</image>
			<item>
		<title>Last post</title>
		<link>http://metalogger.wordpress.com/2009/09/17/last-post/</link>
		<comments>http://metalogger.wordpress.com/2009/09/17/last-post/#comments</comments>
		<pubDate>Thu, 17 Sep 2009 04:14:27 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/2009/09/17/last-post/</guid>
		<description><![CDATA[It&#8217;s a jolly darn shame but unfortunately I cannot see myself in a position to make any more posts here to this blog. It was a happy adventure while it lasted. 
Free for other discussion on metadata issues, though, via neilgodfrey1[AT]gmail.com
       <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=343&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>It&#8217;s a jolly darn shame but unfortunately I cannot see myself in a position to make any more posts here to this blog. It was a happy adventure while it lasted. </p>
<p>Free for other discussion on metadata issues, though, via neilgodfrey1[AT]gmail.com</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/343/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/343/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/343/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/343/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/343/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/343/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/343/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/343/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/343/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/343/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=343&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2009/09/17/last-post/feed/</wfw:commentRss>
		<slash:comments>6</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>Maintaining the distinction between metadata and content files : datasets</title>
		<link>http://metalogger.wordpress.com/2009/06/12/blurring-the-distinction-between-metadata-and-content-files-datasets/</link>
		<comments>http://metalogger.wordpress.com/2009/06/12/blurring-the-distinction-between-metadata-and-content-files-datasets/#comments</comments>
		<pubDate>Fri, 12 Jun 2009 11:00:42 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Metadata]]></category>
		<category><![CDATA[CDWA]]></category>
		<category><![CDATA[cultural heritage]]></category>
		<category><![CDATA[dataset]]></category>
		<category><![CDATA[datasets]]></category>
		<category><![CDATA[EAD]]></category>
		<category><![CDATA[ISAD]]></category>
		<category><![CDATA[ISAD(G)]]></category>
		<category><![CDATA[MODS]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=323</guid>
		<description><![CDATA[Case study: an archival body responsible for cultural preservation
Case study: An archival body has collated text documents, photographs, plans, and more to assist with the preservation of national heritage buildings and monuments. They would now like to have these digitized and stored in a library and generally made publicly accessible.
This gives rise to some interesting [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=323&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><h3>Case study: an archival body responsible for cultural preservation</h3>
<p>Case study: An archival body has collated text documents, photographs, plans, and more to assist with the preservation of national heritage buildings and monuments. They would now like to have these digitized and stored in a library and generally made publicly accessible.</p>
<p>This gives rise to some interesting conceptual nuances in figuring out the best way to structure the data in a library (repository) record.</p>
<p><strong>Textual and pictorial metadata</strong></p>
<p>The archiving agency has a text form that they fill in to accompany the other photographs, volumes, plans etc. But this text form once filled in is not metadata about those accompanying photographs, volumes, plans, etc. The text form contains information about the history, ownership, occupancy, style, architectural and structural details, gazette info, original architects, builders, etc etc about the monument or building.</p>
<p>So all the data that is held by the archiving agency &#8212; text forms, photographs, etc &#8211;&nbsp; is data about the real building or monument &#8220;out there&#8221;, in the street somewhere.</p>
<p>In fact, what the archivists have is a combination of textual and pictorial metadata about real buildings.</p>
<p>The text information and the digitized volumes and photographs all describe the real monuments, either textually or pictorially. The “intellectual content” or “work” for preservation is the real building itself.</p>
<h3>The real target content</h3>
<p>The digitized graphic data and text data about the monuments are datasets. These datasets are sets of data about the real monuments. Although they are archival materials, they are not archival content in the same sense that an historic treaty or letter or diary are archival content in their own right. The archival agency here uses their data (both text and graphic) to inform them about the real monuments &#8220;out there&#8221;.</p>
<p>Compare a history book. A history book can be about events that happened in the real world, but the history book itself is the primary targeted content of a library repository. Users read it as intellectual content in its own right, and to engage with the ideas of the historian. This is not the same as with the above content used for the preservation of <a class="zem_slink" href="http://en.wikipedia.org/wiki/Cultural_heritage" title="Cultural heritage" rel="wikipedia">cultural heritage</a> buildings. In this case, the archived content can be said to be metadata informing users about, and offering tools for, the intellectual content of the cultural heritage in the real world.</p>
<p>The archived photographs, for example, are a form of metadata, that is pictorial metadata, about real monuments. These photographs are not the same as artistic or historic photographs that one might archive for their archival value in their own right. The archival photographs need to be archived or preserved as vital metadata about the real monuments – the real content of interest to which the photographic metadata points.</p>
<h3>Schema considered for handling this kind of dataset:</h3>
<h4>ISAD(G): General International Standard Archival Description and EAD: <a class="zem_slink" href="http://en.wikipedia.org/wiki/Encoded_Archival_Description" title="Encoded Archival Description" rel="wikipedia">Encoded Archival Description</a></h4>
<p>These schema appear at first glance to be the obvious choices for encoding archival data. The EAD has been developed from ISAD. But caution is to be advised:</p>
<ol>
<li>The structure of these schema is hierarchical. That is, it is built around Fonds and Series.&nbsp; This is perfectly fine for nondigital document filing. And it is also fine for digital storage systems that are prepared to “sit as is” in perpetuity. For local display purposes, here and now and the foreseeable future with minimal technological changes, this will probably work very well.</li>
<li>But hierarchical structures in metadata are fraught with difficulties. Hierarchical structures are rarely sustainable across different software platforms. They are also difficult to crosswalk with other metadata schema. (The EAD website does offer a crosswalk to MARC 21, but some of the MARC 21 fields used (e.g. 351 – for hierarchical information) are not common and are not easily mappable to other schema such as MODS or <a class="zem_slink" href="http://en.wikipedia.org/wiki/Dublin_Core" title="Dublin Core" rel="wikipedia">Dublin Core</a>.) For this reason alone EAD is not an ideal schema for long term preservation and interoperability purposes.</li>
<li>Hierarchical structures also work against <a class="zem_slink" href="http://en.wikipedia.org/wiki/Open_Archives_Initiative" title="Open Archives Initiative" rel="wikipedia">OAI</a> searching. The context of any hits can be lost if that information is at a different hierarchical level, with the result that the hit can appear to be either meaningless or misleading via an OAI or DC search.</li>
<li>For the above sorts of reasons, the trend in OAI and the digital library/repository communities is towards nonhierarchical, flat protocols. EAD may be fine for local usage and search and display purposes, but it has the limitation of being left behind in a silo, while our intent is to be sharing data as widely as possible according to our L2010 vision.</li>
<li>ISAD and EAD are designed for addressing collections where the primary content for direct preservation (archiving) are documents such as historical papers and images etc. As explained above, this is not the case with the PMB data. ISAD and EAD are not designed for datasets. Nor are they designed to meet the requirements of OAI protocol searches.</li>
</ol>
<h4>VRA: <a class="zem_slink" href="http://en.wikipedia.org/wiki/Visual_Resources_Association" title="Visual Resources Association" rel="wikipedia">Visual Resources Association</a> and TEI: <a class="zem_slink" href="http://en.wikipedia.org/wiki/Text_Encoding_Initiative" title="Text Encoding Initiative" rel="wikipedia">Text Encoding Initiative</a></h4>
<p>VRA also suffers from some of the limitations of EAD:</p>
<ol>
<li>VRA is designed for addressing collections where the primary content for direct preservation (archiving) are documents such as historical, cultural or artistic images etc. As explained above, this is not the case with the PMB data. VRA is not designed for datasets.</li>
<li>The properties supported by VRA are too limited to support the range of potential properties required by PMB.</li>
</ol>
<p>TEI is designed for manuscripts and large volumes of text.</p>
<h4>CDWA: <a class="zem_slink" href="http://en.wikipedia.org/wiki/Categories_for_the_Description_of_Works_of_Art" title="Categories for the Description of Works of Art" rel="wikipedia">Categories for the Description of Works of Art</a>, and CDWA Lite</h4>
<p>CDWA covers the many of the properties required for descriptive (and some preservation) metadata. It meets the basic descriptive requirements of PMB for their data. However, it also has limitations:</p>
<ol>
<li>I have doubts about how widely supported it would be by software platforms and search and display interfaces, at least without much local IT tweaking. And if systems could be massaged to support it now, there will remain the questions of future sustainability as software and hardware requirements and practices change.</li>
<li>CDWA Lite looks easier to use than the full CDWA, but unfortunately the reason it is “Lite” is because most of the properties that pertain to buildings and real-life monuments have been removed. It is more useful for museum and gallery artworks.</li>
</ol>
<h4>MODS: <a class="zem_slink" href="http://en.wikipedia.org/wiki/Metadata_Object_Description_Schema" title="Metadata Object Description Schema" rel="wikipedia">Metadata Object Description Schema</a></h4>
<p>MODS has the advantage of being able to express the CDWA and other properties specific to PMB requirements in a basically flat structure. It is also highly interoperable with other schema, including Dublin Core. It is an ideal base for creating OAI compliant DC for basic search. It is also very widely recognized and supported by the Library of Congress, and known well enough to be read and understood by most data service providers.</p>
<p>A major advantage of using MODS is that it will be the simplest way of leaving the door open for <a class="zem_slink" href="http://en.wikipedia.org/wiki/Open_Archives_Initiative_Object_Reuse_and_Exchange" title="Open Archives Initiative Object Reuse and Exchange" rel="wikipedia">OAI-ORE</a> as well as <a class="zem_slink" href="http://en.wikipedia.org/wiki/Open_Archives_Initiative_Protocol_for_Metadata_Harvesting" title="Open Archives Initiative Protocol for Metadata Harvesting" rel="wikipedia">OAI-PMH</a> searches and retrievals of data. If the controlled vocabularies used in the archival records and MODS properties are eventually compliant with KOS and RDF models, then semantic web interrogation of the database has the potential to explore and discover relationships and knowledge otherwise lost from view.</p>
<h3>Draft MODS-DC schema for data about cultural monuments, not about the monuments themselves</h3>
<p>Draft of MODS-DC schema for data about cultural heritage monuments, not about the monuments themselves:&nbsp; <a href="http://tinyurl.com/lcuhyy"><strong>http://tinyurl.com/lcuhyy</strong></a></p>
<div style="margin-top:10px;height:15px;" class="zemanta-pixie"><a class="zemanta-pixie-a" href="http://reblog.zemanta.com/zemified/ab039464-d252-4a1c-947c-e23b0ca6b672/" title="Reblog this post [with Zemanta]"><img style="border:medium none;float:right;" class="zemanta-pixie-img" src="http://img.zemanta.com/reblog_e.png?x-id=ab039464-d252-4a1c-947c-e23b0ca6b672" alt="Reblog this post [with Zemanta]"></a></div>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/323/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/323/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/323/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/323/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/323/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/323/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/323/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/323/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/323/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/323/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=323&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2009/06/12/blurring-the-distinction-between-metadata-and-content-files-datasets/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>

		<media:content url="http://img.zemanta.com/reblog_e.png?x-id=ab039464-d252-4a1c-947c-e23b0ca6b672" medium="image">
			<media:title type="html">Reblog this post [with Zemanta]</media:title>
		</media:content>
	</item>
		<item>
		<title>Not defunct, not yet: and afterthoughts on some basics for a repository&#8217;s early-days development</title>
		<link>http://metalogger.wordpress.com/2009/05/21/not-defunct-not-yet-and-afterthoughts-on-some-basics-for-a-repositorys-early-days-development/</link>
		<comments>http://metalogger.wordpress.com/2009/05/21/not-defunct-not-yet-and-afterthoughts-on-some-basics-for-a-repositorys-early-days-development/#comments</comments>
		<pubDate>Thu, 21 May 2009 11:26:06 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=319</guid>
		<description><![CDATA[Hoo boy. I&#8217;ve been away from blogging here for so long I feel embarrassed to return. Glad to see that my posts from my previous life apparently continue to be of some use to others. I&#8217;m in such a totally different world now, however, that I wonder if my ongoing experiences will relate to many [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=319&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Hoo boy. I&#8217;ve been away from blogging here for so long I feel embarrassed to return. Glad to see that my posts from my previous life apparently continue to be of some use to others. I&#8217;m in such a totally different world now, however, that I wonder if my ongoing experiences will relate to many in the areas I&#8217;m most used to &#8212; building repositories in academic environments. Will see . . . . Most of what I have been engrossed in has been constructing application profiles and crosswalks for specialist types of resources (e.g. gi-normous datasets of newspaper articles, national heritage buildings, et al) and preparing Singapore Framework RDF based profiles and team management and education programs, and advising for anything from improved tools and workflows to semantic web directions, social tagging applications and a KOS (knowledge organisation system). Challenging stuff, not least the politics of it all. And cultural factors need negotiation, too.</p>
<p>Maybe I should start new posts here with a few more basics, like some implications of RDA changes for DC based schema or application profiles.</p>
<h3>Building supports</h3>
<p>I keep thinking of one thing I never got to discuss here, and that I regretted not staying long enough at my last university library to see through as far as I had wanted. That is the primary need for building institutional support for a repository that is expected to last the long haul.</p>
<p>Even in a more established setup where I am now the critical importance of this factor is all too obvious and all too easily relegated to secondary efforts. At the university where I was last year, as soon as we had sorted out the basics of our repository software (that is, actually got it working, well almost) and had something to show, the first priority was to invite and involve academic representatives from the different faculties (zeroing in on those few &#8212; even one if that&#8217;s all there is &#8212; with a real personal interest in the OA project), reps from the research reporting/admin department, and from the university governing bodies, and especially a key person from the IT support section. And some librarian liaisons, too.</p>
<p>At our first meeting we demonstrated what was being done, presented the benefits, some of the handicaps we faced, and asked for support &#8212; with inputs from these stakeholders into policy and objectives of the repository. As well as ongoing monitoring and finetuning of these policies and detailed objectives , with a view to the repository thus becoming an integral part of the larger institution&#8217;s self-identity, its face to the world, its internal workflows and consciousness.</p>
<h3>And getting first things first</h3>
<p>Project planning sounds like an obvious need, but I&#8217;ve seen places where it is only given slipshod attention, and I&#8217;ve also seen the very sticky consequences of that neglect. Not that the personnel involved are slack at all. They just have different talents and do what they do best. Hiring a project planner where this is the situation is a necessity, I think.</p>
<p>And then the second basic requirement is planning and working in accordance with known standards and best practice. Forget radical innovation &#8212; at least until after the basics are established and there&#8217;s a bit of a track record with working with a &#8220;basic&#8221; repository and workflow setup.</p>
<p>It&#8217;s much easier to do the boring bits first and involve stakeholders in negotiation to clearly define ones &#8220;designated community&#8221; of users, and the purposes of the repository and how it will be used by whom.</p>
<p>By doing this &#8220;obvious&#8221; stuff first so many questions along the way will become easy to answer. One can spend an awful lot of time and expense trying to design workflows and architecture models and working out what metadata to include in what places, etc etc. But having established from the very get-go exactly who one&#8217;s user group/s are to be, how and for what purposes they are to use the repository, and general policies to set the direction and purposes of the whole show, the time and stress on working out many of those metadata and workflow/architecture basics will just fall into place. The guidelines will be there to make finding the answers relatively easy. Without those prepared-in-advance guidelines one will be pretty much be mostly guessing about what to do next, no matter how often one attempts to rationalize those guesses.</p>
<p>No wonder that the ISO standard, the preservation repository OAIS reference model, mandates that one first document one&#8217;s designated community, negotiate policies and purposes with stakeholders, and how and for whom/why the repository&#8217;s resources are to be used and preserved.</p>
<p>Many academic libraries have a huge advantage over some national or state libraries in the world. Those libraries that are constantly being starved for funds are pretty much forced to find ways to cooperate with peers, to share experiences and resources and ideas, and to make the most cost-effective decisions. Too much $$$ and one major incentive to cooperate and learn from others can wilt. Not good.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/319/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/319/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=319&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2009/05/21/not-defunct-not-yet-and-afterthoughts-on-some-basics-for-a-repositorys-early-days-development/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>MACAR focus and other possibilities?</title>
		<link>http://metalogger.wordpress.com/2009/03/25/macar-focus-and-other-possibilities/</link>
		<comments>http://metalogger.wordpress.com/2009/03/25/macar-focus-and-other-possibilities/#comments</comments>
		<pubDate>Wed, 25 Mar 2009 04:09:10 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Metadata]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=314</guid>
		<description><![CDATA[Since moving to the National and Public library scene the extent to which MACAR&#8217;s focus has been on the sorts of resources housed in academic repositories has become very apparent.  MACAR included non-academic library representatives but they were outnumbered by those from the academic scene.  Now I have joined the National Library Board [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=314&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Since moving to the National and Public library scene the extent to which <a href="http://macar.wikidot.com/">MACAR</a>&#8217;s focus has been on the sorts of resources housed in academic repositories has become very apparent.  MACAR included non-academic library representatives but they were outnumbered by those from the academic scene.  Now I have joined the <a href="http://www.nlb.gov.sg/page/Corporate_portal_page_home">National Library Board of Singapore</a> (NLB) I have had to face up to just how limited in focus MACAR&#8217;s past work has been. On the other hand, maybe progress best kicks off when it is made a small step at a time.</p>
<p>At the NLB a preliminary list of resource types for the national and public library sectors has been prepared. It contains over 50 resource type terms. Some of those will almost certainly be rationalized, but it is clear a list of resource types in a national library is longer than the current MACAR resource type list. And such a list is only the beginning of working towards best practice OAI (ORE?) metadata applications.</p>
<p>With some of the MACAR members now working with <a href="http://www.ands.org.au/">ANDS</a>, and MACAR coming under the umbrella of <a href="http://www.caul.edu.au/">CAUL</a>, one might expect MACAR to be entrenched further with research and academic library requirements.  I wonder if there are any realistic chances of national libraries cooperating with each other and even with academic libraries, and if there are real benefits to be gained by their doing so. Since moving into the national and public library sector I think there are real potentials for users if it could happen. To expose and share the specialist heritage and wider cultural and digitized newspaper collections that such libraries house, especially alongside libraries dedicated to research and education, can only be a Good Thing.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/314/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/314/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/314/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/314/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/314/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/314/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/314/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/314/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/314/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/314/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=314&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2009/03/25/macar-focus-and-other-possibilities/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>back again &#8212; first to continue a discussion on oai and dc</title>
		<link>http://metalogger.wordpress.com/2008/12/10/back-again-first-to-continue-a-discussion-on-oai-and-dc/</link>
		<comments>http://metalogger.wordpress.com/2008/12/10/back-again-first-to-continue-a-discussion-on-oai-and-dc/#comments</comments>
		<pubDate>Wed, 10 Dec 2008 12:41:59 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/2008/12/10/back-again-first-to-continue-a-discussion-on-oai-and-dc/</guid>
		<description><![CDATA[Finally getting back to touch this blog at last! Had been waylaid with intense last minute efforts to finish off a project in Perth before moving and settling in to a new job in Singapore (while still half keeping in touch with Perth).
My first back to the front posts are not posts but comments I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=304&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Finally getting back to touch this blog at last! Had been waylaid with intense last minute efforts to finish off a project in Perth before moving and settling in to a new job in Singapore (while still half keeping in touch with Perth).</p>
<p>My first back to the front posts are not posts but comments I have added in response to another by Diane Hillman that addressed a discussion re <a href="http://metalogger.wordpress.com/2008/10/10/are-repositories-set-to-be-left-out-in-the-cold/">Are Repositories Set to be Left Out in the Cold?</a> (or at least maybe set for a bit of a reconfig one day?) &#8212; <em>DH &#8212; are you still around to respond?</em></p>
<p>Also a pointer to a correction of something I had misunderstood about literals/nonliterals. . . </p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/304/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/304/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/304/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/304/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/304/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=304&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/12/10/back-again-first-to-continue-a-discussion-on-oai-and-dc/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>Repository-publishing-reporting workflow integration</title>
		<link>http://metalogger.wordpress.com/2008/11/21/repository-publishing-reporting-workflow-integration/</link>
		<comments>http://metalogger.wordpress.com/2008/11/21/repository-publishing-reporting-workflow-integration/#comments</comments>
		<pubDate>Fri, 21 Nov 2008 03:24:46 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=295</guid>
		<description><![CDATA[Having attempted a few times on http://techessence.info/node/104 to register to add a comment, without success, to Roy Tennant’s referral to my repository comparison, I am left to make one comment here.
Roy rightly comments that the picture is bigger than I have indicated here (my blog comparison is in fact a truncated rump of something I [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=295&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Having attempted a few times on <a rel="nofollow" href="http://techessence.info/node/104">http://techessence.info/node/104</a> to register to add a comment, without success, to Roy Tennant’s referral to my repository comparison, I am left to make one comment here.</p>
<p>Roy rightly comments that the picture is bigger than I have indicated here (my blog comparison is in fact a truncated rump of something I prepared for another institution to assist them to investigate one specific issue), but I would suggest that his illustration of Digital Commons extra capabilities vis a vis DSpace is one example of many factors that seem to me to cloud decision making among some institutions.</p>
<p>A peer view/publishing system tied in with a repository in order to save on numbers of finger clicks to process publishing and reporting and other administrative work is a good thing.</p>
<p>But a scholarly publishing workflow system does not have to be the preserve of a single enterprise solution. One example from Australia: the Integrated Content Environment for Research and Scholarship (ICE-RS) is a Federal Government funded project to create a collaborative authoring and publishing (cum repository deposit!) tool that is open-source, and capable of integration with repository systems. It does not handle peer-review, though there are thoughts afloat for integrating ICE with the <a href="http://pkp.sfu.ca/?q=ojs">Public Knowledge Project&#8217;s Open Journal Systems</a> software for peer review.</p>
<p>Moreover, the same number of clicks for authors can be reduced for repository ingest purposes where there can be collaboration and integration with a university’s research office and reporting system. Some universities have been able to have current papers (including preprints) deposited in their repositories directly from a research system. The University of Sydney has a workflow line between their <a href="http://www.usyd.edu.au/ro/herdc/irma.shtml">IRMA research reporting system</a> and their DSpace repository. Murdoch University is engaged in establishing a similar workflow between their IRMA system and Fedora repository.</p>
<p>Further, Roy’s allusion to the “highly popular DSpace” repositories may be seen as further testimony to the success and sustainability of open-source solutions in a world of highly competitive and highly expensive enterprise products.</p>
<blockquote><p>M<strong>ore on ICE-RS:</strong></p>
<p><a rel="nofollow" href="http://www.dest.gov.au/Ministers/Media/Bishop/2006/07/B001310706.asp">Australian govt archive news release</a></p>
<p><a rel="nofollow" href="http://adlaustralia.org/idea2006/presentations/ice-presentation/ice.slide.htm">Slide presentation</a></p>
<p><a rel="nofollow" href="http://ice.usq.edu.au/introduction/about.htm">ICE: Integrated Content Management</a></p>
<p><a rel="nofollow" href="http://eprints.usq.edu.au/2653/1/Sefton_etd_2007.pdf">Integrated approach to preparing, publishing, presenting and preserving theses</a></p>
<p><a rel="nofollow" href="http://ask-oss.mq.edu.au/index.php?option=com_content&amp;task=view&amp;id=42&amp;Itemid=69">ASK-OSS post — courseware publishing system</a></p>
<p><a rel="nofollow" href="http://ptsefton.com/blog/2007/08/10/09-25-10.681066/">Why ICE works</a></p>
<p><a rel="nofollow" href="http://ice.usq.edu.au/blog/">ICE developers blog</a></p>
<p>More on Dr Peter Sefton’s <a rel="nofollow" href="http://ptsefton.com/">blog</a></p></blockquote>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/295/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/295/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/295/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/295/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/295/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/295/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/295/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/295/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/295/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/295/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=295&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/11/21/repository-publishing-reporting-workflow-integration/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>INFORMAL comparison of some institutional repository solutions</title>
		<link>http://metalogger.wordpress.com/2008/10/19/comparing-some-institutional-repository-solutions/</link>
		<comments>http://metalogger.wordpress.com/2008/10/19/comparing-some-institutional-repository-solutions/#comments</comments>
		<pubDate>Sun, 19 Oct 2008 04:47:57 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=279</guid>
		<description><![CDATA[Over the last few years I have worked closely with a number of different institutional repository solutions, both open-source and enterprise products. There are several I have not had personal experience with, but I have taken opportunities to speak with a wide number of users of these products, too, as well as with representatives and [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=279&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Over the last few years I have worked closely with a number of different institutional repository solutions, both open-source and enterprise products. There are several I have not had personal experience with, but I have taken opportunities to speak with a wide number of users of these products, too, as well as with representatives and producers of those solutions. I have also sought input from other users of repositories I am personally familiar with in an attempt to balance out my own personal impressions. The following comparison is based on feedback primarily from managers of the systems &#8212; whether they have live production systems or have done extensive testing on systems they expect to take live soon.</p>
<p><strong>The purpose of this comparison</strong> is to give an intro level guideline for institutions interested in &#8220;what (else) is out there&#8221;.</p>
<ul>
<li>The comparisons are not a systematic point-by-point balanced presentation. Anyone interested in a serious in-depth comparison or study of any particular repository solution would need to speak to other users themselves, as well as the producers or agents of the solutions.</li>
<li>It is also restricted to the repositories I know from my experiences in Australia.</li>
<li>I have not referenced costs or specific institutions here.</li>
<li>Nor have I attempted a serious comparison of the IT architecture across the systems.</li>
<li>The main focus is on support provided/needed and functionality of each product.</li>
</ul>
<h3><span style="color:#ff0000;">Digital Commons</span></h3>
<p><a href="http://www.bepress.com/ir/">http://www.bepress.com/ir/</a></p>
<p>Digital Commons is a &#8220;presentation repository&#8221;, not a &#8220;preservation repository&#8221;. Emphasis in a repository designed primarily to showcase an institution&#8217;s research is on an attractive and compelling interface for users, including self-submitters. Digital Commons is a hosted solution (i.e. hosted in California). There is no hardware to purchase, install or maintain. An institution can begin to upload papers immediately after installation. Purchase and maintenance is on a renewable one year or limited number of years basis.</p>
<p>Digital Commons cannot be synchronized with another preservation repository for migration purposes. A preservation repository, unlike Digital Commons, will record and preserve authentication, versioning, rights, structural and descriptive metadata. In Digital Commons such data will not be preserved for migration/exit strategy purposes to a preservation repository.</p>
<p><strong>Support</strong></p>
<ul>
<li>Three universities in Australia using Digital Commons reported that the service from BePress is &#8220;very good&#8221;.</li>
<li>The setup period consists of about 3 weeks. All report that Digital Commons is easy to set up.</li>
<li>Phone hook-ups were used for training and instruction at the beginning. Web demonstrations accompanied these.</li>
<li>Requests by a university to change the front page appearance were responded to quickly and changes made efficiently.</li>
<li>Another institution has requested many ‘fine tuning&#8217; modifications to their instance of Digital Commons, and all these requests have been met &#8220;pretty quickly&#8221;. One institution wanted the format of citations changed and BePress effected this change quickly for them.</li>
<li>One institution who has used Digital Commons for more than 2 years said they had never had any down-time with it.</li>
<li>Nightly uploads of new documents.</li>
</ul>
<p><strong>Functionality</strong></p>
<p><strong><em> </em></strong></p>
<ul type="disc">
<li>Documents can      be set to open or closed access</li>
<li>Different      authentications can be set up for different users</li>
<li>Self submission      is possible</li>
<li>Workflow stages      can be configured &#8220;to some extent&#8221;, so that a central library service can      monitor self-submitted documents for quality control and copyright issues</li>
<li>Embargo      functionality</li>
<li>Different types      of media files can be deposited (e.g. mp3, pdf, video)</li>
<li>OAI harvesting</li>
<li>PRPs (Personal      Researcher Pages) &#8211; this was a strong selling point at one institution. In      addition to the central epubs repository, links can take one to a PRP of      an author, and this PRP can contain a list of their publications, be used      as their homepage, and a point from which to access their documents. The      library instance of Digital Commons &#8220;harvests&#8221; these PRPs and includes      links back to the PRPs on the document pages.</li>
<li>Documents      organized by collections</li>
<li>Able to hide      preparatory work on a document being uploaded until it is ready to go      live.</li>
<li>Reporting and      statistics</li>
</ul>
<h3><span style="color:#ff0000;">DigiTool</span></h3>
<p><a href="http://www.exlibrisgroup.com/category/DigiToolOverview">http://www.exlibrisgroup.com/category/DigiToolOverview</a></p>
<p>An Ex Libris product, DigiTool is a Digital Asset Management System (DAMS). It is designed primarily for teaching functions, and its repository capacities consist of a set of additional modules. The DAMS is primarily designed for teachers to share their digital objects (images, course notes and notices, indexes of resources, exam papers etc.). Much of this data is ephemeral.</p>
<p>DigiTool is not a hosted solution, but there is a community of users, a consortia client of Ex Libris, who do support a &#8220;hosted server&#8221; &#8211; UNILINC: <a href="http://www.unilinc.edu.au/services/hosting.html">http://www.unilinc.edu.au/services/hosting.html</a></p>
<p><strong>Support</strong></p>
<p>Users of DigiTool report that it definitely requires their own local IT support to configure it appropriately for specific institutional needs.</p>
<p><strong>Functionality</strong></p>
<ul>
<li>Functionality depends on the modules purchased. Modules are available for:</li>
</ul>
<blockquote>
<ul>
<li>Academic self-submission</li>
<li>another for collection management (for      arranging objects, adding thumbnails and descriptive metadata)</li>
<li>a JPEG 2000 viewer as an optional plugin</li>
<li>OAI interoperability (harvesting) module</li>
</ul>
</blockquote>
<ul>
<li>The Ex Libris demo of DigiTool says it is scalable, and users of DigiTool all spoke of it being able to do much more than their immediate requirements.</li>
<li>Ex Libris also advertises that it &#8220;supports interoperability through open architecture&#8221;.</li>
<li>Embargo periods</li>
<li>Self-submission via the Deposit Module. The Deposit Module provides an interface and workflow which enables submission of objects and metadata by non-staff users.</li>
<li>Workflow also allows for authorized staff to control, edit, and approve/decline the submitted material.</li>
<li>Different levels of authentication: user/patron authentication is handled by the Local User management function or via LDAP.</li>
<li>Copyright: this can be managed by manual assignment of access rights to the object.</li>
<li>Objects can be assigned with access rights permissions.</li>
<li>The following formats are supported for load into DigiTool: MARCXML, DCXML, MODSXML, CSV, METS  &#8212; Given the claim that DigiTool is based on open architecture, one should expect the data stored would be migratable to other systems.</li>
<li>ExLibris advertises that DigiTool supports preservation standards such as PREMIS and the OAIS (Trusted Repositories) standard model.</li>
<li>DigiTool&#8217;s &#8220;interoperability module&#8221; for OAI harvesting does not configure OAI compliant Dublin Core. This is not a problem for harvesting by the NLA&#8217;s Discovery Service because DS have configured their service provider to read and harvest their DigiTool feeders. But seamless OAI harvesting cannot be guaranteed by other service providers. DigiTool users are expected to be informed of this issue by USQ-Repository Services. I have not been able to learn if this is unique to DigiTool or is also an issue with other proprietary solutions discussed in this report.</li>
<li>Some users see DigiTool&#8217;s deposit procedure as &#8220;klunky&#8221; in trying to get it to do what they want. Editing of objects can require hours (overnight) to take effect; citations need to be created separately since they are not automatically generated; and multiple key strokes are required for some &#8220;simple&#8221; operations such as moving an object from open to restricted access.</li>
<li>One user said that the upcoming version of DigiTool &#8220;promises&#8221; to be able to give them the ability to handle hierarchical structures. &#8220;We think it will do what we want.&#8221;</li>
<li>Citations need to be specifically created in DigiTool &#8211; they are not automatic as in EPrints.</li>
</ul>
<p><strong>Reasons for adoption</strong></p>
<p>Most institutions who have adopted it or who are considering doing so have said that their primary reason was to establish synchronicity with their other Ex Libris products. Some specifically added that it was policy for them to favour enterprise solutions over open-access solutions.</p>
<h3><span style="color:#ff0000;">DSpace</span></h3>
<p><a href="http://www.dspace.org/">http://www.dspace.org/</a> and <a href="http://libraries.mit.edu/dspace-mit/index.html">http://libraries.mit.edu/dspace-mit/index.html</a></p>
<p>DSpace is an open source solution developed by MIT. It has a large and active community of users. At least 450 registered DSpace repositories worldwide are evidence of DSpace&#8217;s robustness, ease of implementation, simplicity of maintenance and ongoing use, and low-cost.</p>
<p><strong>Support</strong></p>
<ul type="disc">
<li>A large and active community of      supporters with experience and expertise available to draw on</li>
<li>Thorough online documentation for IT      staff and managers for customization and implementation</li>
<li>Step by step online tutorials</li>
<li>Online assistance</li>
<li>The amount of local IT support required for the implementation of DSpace depends on the extent of configuration changes an institution wishes to make.</li>
<li>DSpace provides a module, Manakin, which enables the configuration of much more &#8220;original&#8221; interfaces without &#8220;intensive long term&#8221; IT support.</li>
<li>Institutions with basic largely &#8220;out of the box&#8221; configurations report that they can do &#8220;in the main&#8221; without local IT support. The payoff is that a few &#8220;minor issues&#8221; (e.g. maintaining correct indexing records when changing the location of an object from one collection to another) persist.</li>
</ul>
<p><strong>Functionality</strong></p>
<ul>
<li>DSpace manages objects in an hierarchical collections based structure. Collections (or hierarchies of collections) display alphabetically on the main page.</li>
<li>This Collections based organization, with inbuilt workflow and authentication capabilities, enables different faculties or departments to manage their own deposits and structure of their collections. Workflows can be set up to still provide for central quality control and final editing by the library.</li>
<li>Descriptive metadata for the objects has a flat structure, which means that in cases of objects with multiple authors from different affiliations, there is no automatic guarantee that data can be transferred intact from one repository to another. This requires IT support in order to set up, say, a METS package, in order to encapsulate the data in its original relationships for successful migration.</li>
<li>Workflows and authentications are supported.</li>
<li>Embargo periods are supported (metadata page displays but the attached document becomes public at a preset date)</li>
<li>Objects can be made inactive to be hidden from public view.</li>
<li>Different mime types are supported, including video and audio.</li>
<li>DSpace is integrated with Research Management systems in several universities.</li>
</ul>
<h3><span style="color:#ff0000;">EPrints</span></h3>
<p><a href="http://www.eprints.org/">http://www.eprints.org/</a></p>
<blockquote><p>EPrints is an open source solution developed and supported by the University of Southampton. EPrints is &#8220;easy to install, easy to configure, and needs minimal maintenance. Once installed, it simply works without fuss. Over a year, no maintenance has been required to the UTas server apart from updates.&#8221; (Arthur Sale, UTas)</p></blockquote>
<p>All EPrints administrators I have contacted have spoken well of its simplicity and stability. It is widely seen as an ideal repository solution for initial implementation in a university with limited financial resources and IT support.</p>
<blockquote><p>&#8220;EPrints is a mature software package, with an established community. It offers a complete solution for managing a research repository for Open Access. EPrints can be put to other uses, but for other uses such as image repositories alternative software might be more appropriate. . . . However, the software is under active development and it is particularly useful as an Open Access document repository.&#8221; (<a href="http://rubric.edu.au/repositories/eprints.htm">http://rubric.edu.au/repositories/eprints.htm</a>)</p></blockquote>
<p><strong>Support</strong></p>
<blockquote><p>&#8220;Many institutions do not have the resources necessary to build or maintain an institutional repository. The EPrints Services team offers a complete range of advice and consultancy to support institutions who have adopted, or who are looking to adopt, the EPrints solution. We can provide as much or as little support as you need to create and maintain a professional repository.&#8221; &#8211; EPrints site</p></blockquote>
<p>This assistance is gratis to those implementing and maintaining an EPrints repository.</p>
<p>I contacted at least half a dozen universities using EPrints and expressed the unqualified praise for the level and timeliness of support from Southampton. This praise came from both IT staff who have had to liaise with Southampton as well as from repository managers.</p>
<p>Patches and upgrades are released regularly. Users have remarked on the ease with which these are installed and the robustness of their maintenance.</p>
<p><strong>Functionality</strong></p>
<ul class="unIndentedList">
<li> EPrints supports OAI-PMH harvesting protocols.</li>
<li> Plug-ins have been developed so it can also support specific research reporting requirements and for supporting the emerging SWAP (Scholarly Works Application Profile) that is pioneering interoperability and semantic web developments for scholarly works.</li>
<li> Self-submission (with a simple self-submission interface that is quick and easy to learn) is supported.</li>
<li> Workflows can be configured for editors, submitters, and monitoring staff with different permissions.</li>
<li> Objects can be removed from open access.</li>
<li> Batch import (e.g. of ADT records) is supported.</li>
<li> Peer review status, publication status, copyright and other administrative information, and citation generation and statistics by objects and author are all part of the &#8220;out of the box&#8221; package.</li>
<li> EPrints supports text (in particular pdf) and image files, including multiple files per object.</li>
<li> Flat metadata structure. So when there are multiple authors with different affiliations there is no guarantee that the right author-affiliation matches will be maintained in future migrations to other repositories. Ensuring this requires some workarounds that are available, but need IT support to implement. In raising the flat metadata structure issue, it should also be mentioned that EPrints are developing an RDF module that converts their metadata into &#8220;triples&#8221; (subject-predicate-predicate). RDF (Resource Descriptive Framework) is the basis of the emerging Web 3.0 (Semantic Web) and enables data to be converted into multiple schema, including complex hierarchical structures.</li>
<li> EPrints has recently begun to support preservation metadata with the work of its Presev project and this has preservation functions have been implemented with EPrints 3:</li>
</ul>
<p style="padding-left:60px;">1 A history module to record changes to an object and actions performed on an object<br />
2 METS/DIDL plugins to package and disseminate data for delivery to external preservation services</p>
<ul class="unIndentedList">
<li> Slow indexing issues in EPrints have been rectified with the EPrints 3 version.</li>
</ul>
<p>EPrints development has ensured robust functionality and this has limited the file types supported in the earlier versions of the &#8220;out of the box&#8221; EPrints. But successive releases are allowing for wider variety of mime-types to be supported.</p>
<p>One limitation up till now (version 3.0) with EPrints has been the failure of the Embargo facility to publish objects on the release dates. These need to be manually published.</p>
<p>Dates for theses appear as &#8220;date published&#8221; instead of &#8220;date completed&#8221; in at least one institution. It is not clear to me if this is a configuration issue that is resolvable with IT and/or Southampton support.</p>
<p><strong>Users</strong></p>
<p>269 archives are known to be running EPrints archives worldwide.</p>
<blockquote><p>The reason for this recommendation was that it was &#8220;easy to set up, easy to configure, easy to use, and has a very large, open community supporting it and using it  . . . and not all that expensive .  .  . for small institutions such as these, it was ideal.&#8221; (USQ-RS Manager in personal email)</p></blockquote>
<h3><span style="color:#ff0000;">Equella</span></h3>
<p><a href="http://www.thelearningedge.com.au/">http://www.thelearningedge.com.au/</a></p>
<p>Equella is developed by The Learning Edge International (a Tasmanian based company).</p>
<p>Equella is primarily a &#8220;Learning Content Management System&#8221;. Learning Edge also describe it as a repository, but speak of it as a teaching repository tool. It is &#8220;a fully integrated Digital Repository and Content Authoring Tool&#8221;. It can be used as a collaborative lesson planning tool as well as a repository. It can plug into Blackboard and WebCT.</p>
<p>Users have described its administration module as well developed. &#8220;A non-IT person can set it up with a graphic user interface for collections and permissions and configuration.&#8221;</p>
<p><strong>Support</strong></p>
<ul>
<li>One institution needed &#8220;some&#8221; local IT support to set up Equella. They do not need local support for much more than activating periodic patches that Equella sends out now.</li>
<li>Another also said that the &#8220;setting up&#8221; of Equella was most complex part, but that this included the preparatory work. Equella is so very flexible that many decisions need to be made in advance about what exactly was wanted, what should be the best and proper policies. Once this work had been recorded it was relatively easy to setup. All the development work is done by Learning Edge.</li>
<li>One institution said that the initial setup involved two days&#8217; training, which was described as &#8220;very adequate&#8221;.</li>
</ul>
<p><strong>Functionality</strong></p>
<ul type="disc">
<li>self-submission      (including students of higher theses)</li>
<li>different      levels of authentication (easy to setup &#8211; a simple switch; manages various      levels of permissions among academics &#8211; very flexible)</li>
<li>workflow      systems (staged steps to check for copyright compliance with OAKLIST,      Sherpa, etc)</li>
<li>digital      rights management (can aggregate objects to groups for specific workflows      and permissions)</li>
<li>embargo      periods</li>
<li>OAI-PMH      harvesting (e.g. to Google Scholar)</li>
<li>records      can be pulled out of Research Master and sent back into Research Master</li>
<li>one      user noted that the user frontend is boring, uninspiring, but the      functionality behind it is &#8220;great fun&#8221; &#8211; relying on their own IT people to      soon rectify this web appearance.</li>
</ul>
<p><strong><br />
Users<br />
</strong></p>
<p>Equella has about 18 clients in Australia, including several Tasmanian, Queensland, South Australian and Victorian TAFEs and Education Departments. One institution chose Equella to handle 4 primary tasks:</p>
<ul>
<li>RQF reporting (now ERA)</li>
<li>As a backend to Moodle CMS</li>
<li>To be the base of the e-reserve collection</li>
<li>Developing a university research repository</li>
<li>Open Source solutions were not an option because of limitations of the brief.</li>
</ul>
<p>Another because:</p>
<ul>
<li>it has a well defined administration module, so a non-IT person can set it up with a graphic user interface with various collections and permissions, and they did not want to align with a particular library system</li>
</ul>
<h3><span style="color:#ff0000;">Fez</span></h3>
<p><a href="http://dev-repo.library.uq.edu.au/wiki/index.php/Main_Page">http://dev-repo.library.uq.edu.au/wiki/index.php/Main_Page</a></p>
<p>Fez is a front end to the Fedora repository software. It is developed by the University of Queensland Library as an open source project hosted on SourceForge. See <a href="http://sourceforge.net/projects/fez/">http://sourceforge.net/projects/fez/</a></p>
<p>Project Overview: <a href="http://espace.library.uq.edu.au/documentation/">http://espace.library.uq.edu.au/documentation/</a></p>
<p>Fez is part of the Australian Partnership for Sustainable Repositories (APSR). Fez is one of the deliverables of the APSR eScholarship testbed in the University of Queensland Library.</p>
<p><strong>Support</strong></p>
<ul>
<li>University of Queensland developer/s does offer support for other users of Fez. These should be contacted for details.</li>
<li>To implement Fez an IT officer with the following basic knowledge set would be required:
<ul>
<li>* MySQL<br />
* Fedora<br />
* Fez &#8211; written in PHP, but also CSS, html (Smarty html templating)<br />
* Apache<br />
* (related other pre-req software)<br />
* Understanding how it all fits together</li>
</ul>
</li>
<li>This would also involve that person (obviously) having programmer level access to the server on which Fez ran (something to be considered, depending on your IT department&#8217;s policies).</li>
<li>The University of Queensland developer</li>
</ul>
<p>RUBRIC recommendation at February 2007:</p>
<p style="padding-left:30px;">For institutions wanting to run a general purpose repository Fez is a promising choice, provided that the technical resources are available to manage it. Contact the developers, as they may be able to offer support under a formal arrangement.</p>
<p><strong>Functionality</strong></p>
<ul>
<li>Fez is a rapidly maturing repository software application.</li>
<li>Fez is built around constructs known as &#8220;Communities&#8221; and &#8220;Collections&#8221;.</li>
<li>Supports self-archiving</li>
<li>Workflow authentications and authorizations. These are configurable through GUI interface.</li>
<li>Security based on FezACML to describe user roles and rights on a per object basis or through parent collection or community security inheritance.</li>
<li>Security at object granularity.</li>
<li>Statistics (eg Downloads per Author, per Community, per Collection, per Subject etc).</li>
<li>Preservation metadata extraction</li>
<li>OAI service provider for harvesting</li>
<li>Supports migration to and from other repository systems (DSpace, EPrints, VITAL)</li>
</ul>
<h3><span style="color:#ff0000;">VITAL</span></h3>
<p><a href="http://www.vtls.com/products/vital">http://www.vtls.com/products/vital</a></p>
<blockquote><p>VITAL provides every feature–ingesting, storing, indexing, cataloging, searching and retrieving–required for handling large text and rich content collections. VITAL takes advantage of technology standards such as RDF, XML, TEI, EAD and Dublin Core to easily describe and index an assortment of electronic resources. VITAL leverages the benefits of open-source solutions such as Apache, MySQL, McKOI and FEDORA™. VITAL conforms to common Internet data communications standards such as TCP/IP, HTTP, SOAP and FTP. Additional standards utilized include WSDL Web Services, OAI-PMH, Dublin Core, MARCXML, JHOVE, MIX (Metadata for Images in XML Schema), and SRU.  <em>(from the VTLS site)</em></p></blockquote>
<p>A PREMIS datastream is also generated at ingest.</p>
<p><strong>Support</strong></p>
<p>Australia&#8217;s community of VITAL users has been able to coordinate support through <a href="http://www.arrow.edu.au/">ARROW</a>. 2008 is the final year of the ARROW project. ARROW is expected to be replaced by a CAUL sponsored body, CAIRRS, with a remit of supporting the Australian university repository community more generally. VTLS has acknowledged that their past record in prior testing of new product versions, and their follow up support, could have been better. At a recent ARROW meeting in Brisbane, a VITAL representative assured users that VITAL itself would upgrade users&#8217; current versions to 3.4, due for release around the end of October. In the past, patches have not always been available for recognized issues (including significant ones such as certain PDF files not able to be indexed) and users have had to wait for new version releases. VTLS does have an online &#8220;hotline&#8221; for logging such issues.</p>
<p><strong>Functionality</strong></p>
<p>The VITAL product promises much. It has the advantages of a Fedora base, which enables the storage of a wide variety of content types, and greater sophistication in their management and support for any standard metadata schema (hierarchical or flat).</p>
<ul>
<li>Documents can      be set to active or inactive (public or hidden) &#8212; although at the moment of deposit they necessarily default to active</li>
<li>Self submission      is possible through VALET</li>
<li>Workflow stages can be configured so that a central library service can monitor self-submitted documents for quality control and copyright issues</li>
<li>No embargo      functionality</li>
<li>Different types      of media files can be deposited (e.g. mp3, pdf, video)</li>
<li>OAI harvesting</li>
<li>Statistics &#8212; although some institutions opt to hide this function because the stats file consistently corrupts at regular intervals or the statistics re-set to zero, although this is said to be fixed in version 3.4</li>
<li>Copyright: this can be managed by manual assignment of access rights to the object.</li>
<li>Supports migration to and from other repository systems (DSpace, EPrints, VITAL) with METS.</li>
<li>Ability to search the full-text content of PDF, DOC, RTF and other document formats &#8212; although there are currently security issues with this function, such as public searches not being completely cut off from &#8220;hidden objects&#8221;</li>
<li>Ability to display multi-page documents.</li>
<li>Integrated editors for easy editing of metadata.</li>
<li>Customizable templates for display of content.</li>
</ul>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/279/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/279/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/279/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/279/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/279/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/279/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/279/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/279/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/279/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/279/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=279&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/10/19/comparing-some-institutional-repository-solutions/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>The Politics of Research: another reason Research codes (RFCD / FOR. . .) are NOT a Subject substitute</title>
		<link>http://metalogger.wordpress.com/2008/10/17/the-politics-of-research-another-reason-research-codes-rfcdfor-are-not-a-subject-substitute/</link>
		<comments>http://metalogger.wordpress.com/2008/10/17/the-politics-of-research-another-reason-research-codes-rfcdfor-are-not-a-subject-substitute/#comments</comments>
		<pubDate>Fri, 17 Oct 2008 03:04:50 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Metadata]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=266</guid>
		<description><![CDATA[There are many reasons many university institutional research repositories use reporting codes (in Australia RFCD and FOR) as a convenient substitute for a subject search index. &#8220;Out of the box&#8221; technical configurations of repository software, costs of adding new configurations and adjusting existing portals to accommodate them, status quo situations with harvesters and costs of [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=266&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>There are many reasons many university institutional research repositories use reporting codes (in Australia RFCD and FOR) as a convenient substitute for a subject search index. &#8220;Out of the box&#8221; technical configurations of repository software, costs of adding new configurations and adjusting existing portals to accommodate them, status quo situations with harvesters and costs of entering and maintaining controlled vocabularies are the most obvious ones.</p>
<p>Today I had my first collective meeting with library, legal, R&amp;D and academic reps at our university as a first step towards garnering broader institutional support for a research repository. While we were discussing some details of the data required for deposit into the repository one of the academics gave me a thorough lesson in what those research codes meant in his circle.</p>
<p>A project on forest ecology could fit under research codes for forestry or ecology. In deciding which one to use for a grant application, it will be natural to consider which one is the more likely to lead to the better financial support. Foresters would obviously be more sympathetic to a forest ecology project that offered improved timber yields, and ecologists more favourable to one reducing logging quotas.</p>
<p>Technically librarians or repository editors can, of course, simply add more codes to cover all bases, but by doing that they will be compromising any preservation and authentication functions the repository might have. The fact that a research paper was reported and funded as a &#8220;forestry&#8221; project, and not as an &#8220;ecology&#8221; one, may well be considered a vital part of the record.</p>
<p>While it has been easy to think of research codes as a &#8220;good enough&#8221; substitute for subject indexes in repositories, this particular academic mentor left no room for ambiguity:</p>
<blockquote><p><strong><em>I can&#8217;t imagine <span style="text-decoration:underline;">anyone</span> using research codes to search for subjects!</em></strong></p></blockquote>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/266/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/266/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/266/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=266&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/10/17/the-politics-of-research-another-reason-research-codes-rfcdfor-are-not-a-subject-substitute/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>Are repositories set to be left out in the cold?</title>
		<link>http://metalogger.wordpress.com/2008/10/10/are-repositories-set-to-be-left-out-in-the-cold/</link>
		<comments>http://metalogger.wordpress.com/2008/10/10/are-repositories-set-to-be-left-out-in-the-cold/#comments</comments>
		<pubDate>Fri, 10 Oct 2008 03:01:50 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Dublin Core]]></category>
		<category><![CDATA[Harvesting]]></category>
		<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=247</guid>
		<description><![CDATA[Repositories and their harvesters have a rule of their own that violates Dublin Core standards. Because of this, are repositories and harvesters on target for a massive retroversion or major set of patches if they are to be a part of the semantic web? (I don&#8217;t know, but I&#8217;d like to be sure about the [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=247&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p>Repositories and their harvesters have a rule of their own that violates Dublin Core standards. Because of this, are repositories and harvesters on target for a massive retroversion or major set of patches if they are to be a part of the semantic web? (I don&#8217;t know, but I&#8217;d like to be sure about the answer.)</p>
<p>Once again at a Dublin Core conference I listened to some excellent presentations on the functionality and potential applications of Dublin Core, but this time I had to see if I could poop the party and ask at least one speaker why the nice theory and applications everywhere simply did not work with the OAI harvesting of repositories.</p>
<p>I like to think that standards have good rationales. The web, present and future (e.g. the semantic web) is  predicated upon internationally recognized standards like Dublin Core. According to the <a href="http://dublincore.org/documents/dces/">DCMI site</a> the fifteen element descriptions of Simple Dublin Core have been formally endorsed by:</p>
<ul>
<li>ISO Standard 15836-2003 of February 2003 [<a href="http://dublincore.org/documents/dces/#ISO15836">ISO15836</a>]</li>
<li>ANSI/NISO Standard Z39.85-2007 of May 2007 [<a href="http://dublincore.org/documents/dces/#NISOZ3985">NISOZ3985</a>]</li>
<li>IETF RFC 5013 of August 2007 [<a href="http://dublincore.org/documents/dces/#RFC5013">RFC5013</a>]</li>
</ul>
<p>But there is one area where there is a clear conflict between DCMI element definitions and OAI-PMH protocols. The <a href="http://dublincore.org/documents/usageguide/elements.shtml">DC usage guide</a> explains the identifier element:</p>
<p style="padding-left:30px;"><strong>4.14. Identifier</strong></p>
<p style="padding-left:30px;"><em>Label: Resource Identifier</em></p>
<p style="padding-left:30px;"><em>Element Description:</em> An unambiguous reference to the resource within a given context. Recommended best practice is to identify the resource by means of a string or number conforming to a formal identification system. Examples of formal identification systems include the Uniform Resource Identifier (URI) (including the Uniform Resource Locator (URL), the Digital Object Identifier (DOI) and the International Standard Book Number (ISBN).</p>
<p style="padding-left:30px;"><em>Guidelines for content creation:</em></p>
<p style="padding-left:30px;">This element can also be used for local identifiers (e.g. ID numbers or call numbers) assigned by the Creator of the resource to apply to a particular item.<strong><span style="color:#ff0000;"> It should not be used for identification of the metadata record itself.</span></strong></p>
<p>Contrast the <a href="http://www.openarchives.org/OAI/openarchivesprotocol.html#Repository">OAI-PMH protocol</a>:</p>
<p style="padding-left:30px;"><strong>A <em>unique identifier</em> unambigiously identifies an item within  a repository; the unique identifier is used in OAI-PMH requests for extracting  metadata from the item.</strong> Items <strong>may</strong> contain metadata in  multiple formats. The unique identifier  maps to the item, and all possible records available  from a single item share the same unique identifier.</p>
<p>The same protocol explains that an item is clearly distinct from the resource and points to metadata about the resource:</p>
<ul>
<li><strong><em>resource</em></strong> &#8211; A resource is the object or &#8220;stuff&#8221; that metadata is &#8220;about&#8221;. The nature of a resource, whether it is physical or digital, or whether it is stored in the repository or is a constituent of another database, is outside the scope of the OAI-PMH.</li>
<li><strong><em>item</em></strong> &#8211; An item is a constituent of a repository from which metadata about a resource can be disseminated. That metadata may be disseminated on-the-fly      from the associated resource, cross-walked from some canonical form, actually stored in the repository, etc.</li>
<li><strong><em>record</em> </strong>- A record is metadata in a specific metadata format. A record is returned as an XML-encoded byte stream in response to a protocol request to disseminate a specific metadata format from a constituent item.</li>
</ul>
<p>I wrote about this clash of standards and protocols in <a href="http://metalogger.wordpress.com/2007/09/02/repository-display-configurations-can-clash-with-oai-harvesters/">another post last year</a>. One response was to direct readers to <a href="http://webservices.itcs.umich.edu/mediawiki/oaibp/?TableOfContents">Best Practices for OAI Data Provider Implementations and Shareable Metadata</a>.</p>
<p>The working result for many repositories is a crazy inconsistency. Within a single Dublin Core record for OAI harvesting the same element name, identifier, can actually be used to identify different things:</p>
<pre style="padding-left:30px;">&lt;oai_dc:dc
     xmlns:oai_dc="http://www.openarchives.org/OAI/2.0/oai_dc/"
     xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
     xsi:schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/
     http://www.openarchives.org/OAI/2.0/oai_dc.xsd"&gt;
   &lt;dc:title&gt;Using Structural Metadata . . . &lt;/dc:title&gt;
   &lt;dc:creator&gt;Dushay, Naomi&lt;/dc:creator&gt;
   &lt;dc:subject&gt;Digital Libraries&lt;/dc:subject&gt;
   &lt;dc:description&gt;[Abstract here]&lt;/dc:description&gt;
   &lt;dc:description&gt;23 pages including 2 appendices&lt;/dc:description&gt;
   &lt;dc:date&gt;2001-12-14&lt;/dc:date&gt;
   &lt;dc:type&gt;e-print&lt;/dc:type&gt;
   <span style="color:#ff0000;">&lt;dc:identifier&gt;http://eprints.repository.edu/318/&lt;/dc:identifier&gt;
   &lt;dc:identifier&gt;<tt>1-85636-082-X</tt>&lt;/dc:identifier&gt;</span>
 &lt;/oai_dc:dc&gt;</pre>
<p>In this OAI DC the first identifier identifies the splash page for the resource in the repository. The second identifier identifies the resource itself. It works for now, between agreeable partners. But how sustainable is such a contradiction? What is the point of standards?</p>
<p>As far as I understand the issue, this breakdown in the application of the Dublin Core standard is the result of institutional repositories needing their own branding to come between users and the resources they are seeking. Without that branding they would scarcely have the institutional support that enables them to exist in the first place.</p>
<p>Surely there must be other ways for harvesters to be aware of the source of any particular resource harvested and hence there must be other ways they can meet the branding requirement. Surely there is a way to retrieve an identified resource (not an identified metadata page about the resource) and to display it with some branding banner that will alert users to the repository &#8212; and related files and resources &#8212; where it is archived. Yes?</p>
<p>I mention &#8220;related files and resources&#8221; along with the branding page &#8212; but maybe this is a separate issue. Where a single resource consists of multiple files then is the metadata page a valid proxy for that resouce anyway? Or is there another way of displaying these?</p>
<p>Australia has had the advantage of a national metadata advisory body, <a href="http://www.arrow.edu.au/macar">MACAR</a>. The future of MACAR into next year is still under discussion, but such an issue would surely be an ideal focus for such a body &#8212; to examine how this clash impacts the potentials of repositories today and in the future. A national body like MACAR has a lot more leverage for pioneering changes if and where necessary.</p>
<p>What should be done?</p>
<p>What can be done?</p>
<hr /><em>But is there more? more confusion of terms?<br />
</em></p>
<p><em>In having another look at the DCMI site for this post I noticed something else in the latest <a href="http://dublincore.org/documents/dces/">DC Element Set</a></em> description page:</p>
<p style="padding-left:30px;"><em><strong>Term Name: identifier</strong><br />
URI: http://purl.org/dc/elements/1.1/identifier<br />
Label: Identifier<br />
Definition: An unambiguous reference to the resource within a given context.<br />
Comment: Recommended best practice is to identify the resource by means of a string conforming to a formal identification system.</em></p>
<p><em>DCMI recommends that an identifier be &#8220;a string&#8221;. In the context of RDF and the semantic web my understanding of &#8220;string&#8221; is a dead-end set of letters as opposed to a resolvable uri or &#8220;thing&#8221;. But the <a href="http://dublincore.org/documents/usageguide/elements.shtml#identifier">DC Usage Guide</a> &#8220;explains&#8221; that an applicable formal identification system allowed here can also be a URI.  So what don&#8217;t I understand about the difference between strings and (RDF) things, now? </em></p>
<hr />
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/247/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/247/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/247/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/247/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/247/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/247/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=247&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/10/10/are-repositories-set-to-be-left-out-in-the-cold/feed/</wfw:commentRss>
		<slash:comments>12</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
		<item>
		<title>Online journals and institutional repositories &#8211; comparing their potential impacts on research methods (and journal publications)</title>
		<link>http://metalogger.wordpress.com/2008/10/02/a-comparison-of-impacts-of-online-journal-databases-and-institutional-repositories/</link>
		<comments>http://metalogger.wordpress.com/2008/10/02/a-comparison-of-impacts-of-online-journal-databases-and-institutional-repositories/#comments</comments>
		<pubDate>Thu, 02 Oct 2008 12:55:53 +0000</pubDate>
		<dc:creator>neilgodfrey</dc:creator>
				<category><![CDATA[Repositories]]></category>

		<guid isPermaLink="false">http://metalogger.wordpress.com/?p=227</guid>
		<description><![CDATA[ 
While discussing the potential impact of institutional repositories on journal publications, an academic alerted me to an article &#8212; Electronic Publication and the Narrowing of Science and Scholarship by James A. Evans &#8212; discussing research into the impact of the online journal culture on research methods in the world of scholarship.
The article is interesting [...]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=227&subd=metalogger&ref=&feed=1" />]]></description>
			<content:encoded><![CDATA[<div class='snap_preview'><br /><p><!--[if gte mso 9]&gt; Normal   0               false   false   false      EN-AU   X-NONE   X-NONE                                                     MicrosoftInternetExplorer4 &lt;![endif]--><!--[if gte mso 9]&gt; &lt;![endif]--><!--  --><!--[if gte mso 10]&gt; &lt;!   /* Style Definitions */  table.MsoNormalTable 	{mso-style-name:"Table Normal"; 	mso-tstyle-rowband-size:0; 	mso-tstyle-colband-size:0; 	mso-style-noshow:yes; 	mso-style-priority:99; 	mso-style-qformat:yes; 	mso-style-parent:""; 	mso-padding-alt:0cm 5.4pt 0cm 5.4pt; 	mso-para-margin:0cm; 	mso-para-margin-bottom:.0001pt; 	mso-pagination:widow-orphan; 	font-size:10.0pt; 	font-family:"Calibri","sans-serif";} --> <!--[endif]--></p>
<p>While discussing the potential impact of institutional repositories on journal publications, an academic alerted me to an article &#8212; <em><strong><a href="http://www.sciencemag.org/cgi/content/abstract/321/5887/395">Electronic Publication and the Narrowing of Science and Scholarship</a> by James A. Evans</strong></em> &#8212; discussing research into the impact of the online journal culture on research methods in the world of scholarship.</p>
<p>The article is interesting for several reasons, but I want to address its conclusions in relation to research repositories &#8211; something that the research did not intend to address. I think such a comparison is worthwhile to the extent that it helps clarify what I see as how institutional repositories and online journal databases each differently potentially impacts both research methods and publishing companies&#8217; futures. I also can&#8217;t resist a comment on my thoughts of where we are headed subsequent to the current world of online databases, whether of journals or research repositories.</p>
<p>The opening abstract of the article notes:</p>
<p style="padding-left:30px;"><em>Online journals . . .  are used differently than print &#8212; scientists and scholars tend to search electronically and follow hyperlinks rather than browse or peruse . . . .</em></p>
<p><strong>IR comparison comment:</strong> IRs, on the other hand, do facilitate <strong>browsing</strong> by keywords, titles, authors, year, resource type (that is, text or still image or video . . . )  &#8211; not just searching as per online journals like JSTOR or EBSCO.</p>
<p>Evans&#8217; research into the impact of online journal databases found that:</p>
<p style="padding-left:30px;"><em>As deeper backfiles became available, more recent articles were referenced; as more articles became available, fewer were cited and citations become more concentrated within fewer articles.</em></p>
<p><strong>Costs and benefits of online journal databases</strong></p>
<p>His interpretation of this paradox:</p>
<ol>
<li>as online searching replaces browsing in print, there is greater avoidance of older and less relevant literature;</li>
<li>hyperlinking through an online archive puts experts in touch with consensus about the most important prior work &#8211; what work is broadly discussed and referenced;</li>
<li>thus online search bypasses many marginally related articles that are still skimmed by print researchers.</li>
</ol>
<p style="padding-left:30px;"><em>Findings and ideas that do not become consensus quickly will be forgotten quickly.</em></p>
<p style="padding-left:30px;"><em> </em></p>
<p style="padding-left:30px;"><em>This research ironically intimates that one of the chief values of print library research is poor indexing. Poor indexing &#8211; indexing by titles and authors, primarily within core journals &#8211; likely had unintended consequences that assisted the integration of science and scholarship. By drawing researchers through unrelated articles, print browsing and perusal may have facilitated broader comparisons and led researchers into the past.</em></p>
<p>Evans sees this as one more step away from the contextualized monograph:</p>
<p style="padding-left:30px;"><em>the contextualized monograph, like Newton&#8217;s Principia or Darwin&#8217;s Origin of the Species, to the modern research article. The Principia and Origin, each produced over the course of more than a decade, not only were engaged in current debates, but wove their propositions into conversations with astronomers, geometers, and naturalists from centuries past.</em></p>
<p>Thus the higher efficiency with which arguments can be framed with the assistance of online searching and hyperlinking, &#8220;the more focused &#8211; and more narrow &#8211; past and present&#8221; they will be.</p>
<p>It is not a strictly fair comparison, but this does remind me of a time I was inspired with fresh insights into a topic I was investigating simply from the serendipitous luck of accidentally noticing a title on a tangential topic placed just one library shelf above the one which held the exact classification numbers I was directed to browse. No online search &#8211; nor any print citation index &#8211; could ever enable me to repeat that particular stroke of luck.</p>
<p>In other words, any technological change will charge some costs against the way we used to do things. Maybe the advent of the printing press led some researchers to miss the chance discoveries they made in the days they had to rely on personal travel to where certain hand-copied books were known to be stored. But each change also brings its own new avenues for broader comparisons and insights from unintended serendipity. If this is not currently happening on a large enough scale to impact the statistical research of Evans&#8217; article, then it is reassuring to know that this is not the end of the story.</p>
<p>But no doubt Evans also would acknowledge that the broader conversations involved in works like <em>Principia</em> and <em>Origin</em> were themselves scarcely the outcome of unintended consequences of the relatively poor indexing of the print media. Today&#8217;s researchers and scholars, I suspect, are under far more institutional pressures to specialize, produce and publish at a certain rate than regularly experienced by Newton and Darwin.</p>
<p>Innovation also follows demands and needs, including those of researchers. Online journal and e-book databases and catalogs are not the only points to which the electronic media are leading.</p>
<p><strong>Comparing IRs with online journals, their functions and potential impacts</strong></p>
<p>IRs are finding their way into a growing number of universities. (See my <a href="../../../../../2008/08/04/australian-university-repositories-research-and-publications/">list of current Australian IRs and links to other registries</a>.) These IRs do support broader opportunities for browsing, not just searching, by both uncontrolled keywords and sometimes controlled vocabularies, resource types, authors and titles. Browsing is one of the alleyways the Evans article laments is missing with electronic searches. That is not the case with most IRs. Nor are IRs restricted to the localized browsing of a single institution&#8217;s research repository. They can be browsed collectively through harvesters such as OAIster, the Australia&#8217;s Discovery Service, etc.</p>
<p>IRs are something other than a high-tech attempt at a more efficient journal database. They are quite different. They are a means of individual academics &#8211; and their institutions &#8212; making visible and accessible their collective works. They are a means of both showcasing and preserving personal and institutional research, and also of making publicly funded research instantly and openly accessible to all.</p>
<p>IRs offer the capacity to more easily interrogate the discussions of individual authors over time. This is surely a potentially useful alternative to journal databases that are structured around topics. Researchers do publish across many journals and the history of a single researcher&#8217;s output can be immediately apparent in an IR even though they have been published across a wide range of journals.</p>
<p>Not only immediately apparent, but immediately accessible in those repositories generated by the open access principle that holds that publicly funded research should be publicly available. Most online journal databases restict access to those who belong to an institution with the appropriate subscriptions. If a journal is neither online nor held in print by an institution then one must wait days for the interlibrary loan process before accessing the article. In an open access repository it is instantly available. No waiting time or tedious paperwork/online-form processes to negotiate.</p>
<p>Journal titles that originally hosted these publications are also referenced, thus in some cases also raising awareness of journals that might otherwise have been less widely known.</p>
<p><strong>Where it&#8217;s all headed?</strong></p>
<p>That&#8217;s the present. And the web is still in its infancy. Our wheels are still revving in the first generation of the Web, Web 1.0. Online journal databases, and institutional repositories too, are nothing more than a mass of web pages or documents waiting to be accessed. They are little more than a &#8220;more efficient&#8221; form of the print media and print indexing. In the case of IRs, they have the bonus of allowing more innovative and extensive browsing, too. Web 2.0 is a cute next step allowing social networking, which a growing number of scholars are finding really is more than simply cute. But the next evolutionary step, Web 3.0, is beginning to mutate.</p>
<p>This will be <a href="http://en.wikipedia.org/wiki/Semantic_web">the semantic web</a>, where information will be meaningfully contextualized in the way early (19<sup>th</sup> century!) information managers and innovators (thinking of <a href="http://en.wikipedia.org/wiki/Charles_Ammi_Cutter">Charles Cutter</a>), and who knew only the print medium, originally intended. The semantic web will mean an online world where all the varying information topics (not just web pages or pdf files) have their uri namespaces and where it will be possible for users to search through them via meaningful relationship enquiries (not just &#8220;X links to Y&#8221; but &#8220;how X links to Y&#8221;; and both X and Y can be interrogated within their ontological relationship to each other &#8211; is one a subset of the other? or is one an echo of another in a different discipline?) Not that Cutter envisaged the semantic web, of course. But he did seek a way of organizing information in a way that was more meaningful and useful than the classification systems that we ended up with in libraries.</p>
<p>Part of this will be the exchange and reuse of objects within datasets and research databases (<a href="http://www.openarchives.org/ore/">ORE</a>). Datasets from different fields and disciplines can scarcely &#8220;talk&#8221; to each other today because of their different measuring and conceptual modelling. Any overlapping concepts are known to few outside those familiar with both disciplines, and even those few may rarely be able to make use of such overlaps because of their varying languages. The next stage of web development will see a working towards the ability to interrogate meaningfully, and select and re-use in other contexts the specific information we seek, as well as the ability to explore major and minor side-avenues. We will not be restricted to searching or browsing pages that have been prepared for searching and browsing (and &#8220;mere&#8221; hyperlinking) by others.</p>
<p>If we are moving away from the &#8220;humanist&#8221; benefits of inefficient print indexing, we are, I believe, moving towards an even greater scope for creatively exploring the total chaos of information.</p>
  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/metalogger.wordpress.com/227/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/metalogger.wordpress.com/227/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godelicious/metalogger.wordpress.com/227/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/delicious/metalogger.wordpress.com/227/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gostumble/metalogger.wordpress.com/227/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/stumble/metalogger.wordpress.com/227/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/godigg/metalogger.wordpress.com/227/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/digg/metalogger.wordpress.com/227/" /></a> <a rel="nofollow" href="http://feeds.wordpress.com/1.0/goreddit/metalogger.wordpress.com/227/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/reddit/metalogger.wordpress.com/227/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=metalogger.wordpress.com&blog=559463&post=227&subd=metalogger&ref=&feed=1" /></div>]]></content:encoded>
			<wfw:commentRss>http://metalogger.wordpress.com/2008/10/02/a-comparison-of-impacts-of-online-journal-databases-and-institutional-repositories/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://1.gravatar.com/avatar/9eb80d11dd33a3b625e2b0a1204fdb45?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">neilgodfrey</media:title>
		</media:content>
	</item>
	</channel>
</rss>