Wednesday, October 3, 2012

Using dbpedia to jumpstart EAC-CPF record creation

Dbpedia offers a wealth of open information in the form of RDF that can be used for all sorts of purposes.  It contains links to resources about the Wikipedia topic available online, the birth and death dates of individuals, subjects, occupations, and a variety of relations.  Additionally, abstracts and names are available in a plethora of languages.  These data can be used to generate fairly sophisticated EAC-CPF stubs, and over the last few weeks I have implemented two approaches to generating EAC from dbpedia RDF.

  1. In xEAC, through the XForms interface.
  2. With a PHP script which is offered open source in the xEAC github repository.

Generating EAC-CPF stubs with XForms in xEAC


I'll first address #1 above.  Suppose, for example, you are creating a new record in xEAC for Alexander the Great, whose resource is represented by http://dbpedia.org/resource/Alexander_the_Great.  By clicking the "Import DBpedia Data" trigger at the top of the page, a window will launch for the user to enter the resource URI.  After checking to see whether that is a viable resource, the user will have a list of options to check for importation: names, abstract, exist dates, CPF relations, resource relations, and thumbnail.

Friday, July 27, 2012

xEAC: XForms for EAC-CPF beta released

Today, two packages for a xEAC beta have been made available on github.  The following email is what I sent out to the Encoded Archival Description (EAD) list:

In early June I participated in the Linked Ancient World Data Institute at NYU, and there is a growing interest in ancient prosopography among classicists.  Having straddled both the museum and library/archival worlds, it makes sense to me to use EAC-CPF for this endeavor.  Several participants are now working together to build a network of Roman emperors and their relations.  We're in the data gathering stage at this point, but I have been working for the last six weeks an an XForms-based information management system for EAC-CPF records called xEAC (pronounced "zeke").  I think that it is ready for a beta release and have prepared two downloadable packages that can be dropped into Apache Tomcat.


Features


Generic installation instructions are found at http://wiki.numismatics.org/xeac:generic_installation

xEAC will continue to evolve over the coming months.  From a numismatic standpoint, I plan to link emperors together with their families, link emperors to provincial governors, link these people to mints (corporate entities), and link mints to anonymous die carvers which are identifiable by their artistic style.  Each of these entities can be linked to affiliated coins to start, but we can eventually associate sculpture, epigraphy, and literary sources to them.

Friday, December 2, 2011

EADitor December 2011 (.1112) beta + Improved Documentation

There are two major updates to report:

1.  The latest beta packages have been released to the Google Code download section.

Features (apart from creating, editing, publishing, and deleting EAD finding aids):

  • Public interface with faceted search results and facet-based OpenLayers mapping 
  • Linked data and geographic services: OAI-PMH feed, Solr-based Atom feed (embedded with geographic points) and search results in the form of KML
  • Geonames, LCSH, VIAF APIs for geographic, subject term, personal, and corporate name controlled vocabulary
  • Upload finding aids from the "wild" (if they adhere to EAD 2002).
  • Interface for reordering and setting permissions of components
  • Flickr API integration, attach flickr images as a daogrp
  • Simple template controls for EAD finding aids
  • Introduction of simple themes: select facet orientation on search page and from a selection of jQuery UI themes (theme controls will be enhanced over time)

2.  There is a new documentation wiki: http://wiki.numismatics.org/eaditor:eaditor.  Documenting EADitor will be a gradual process; I plan to devote several hours per week to the task.  The wiki will include instructions for installation, use, and eventually, customization (including Fedora and DSpace integration).

Installation Instructions
Generic installation instructions can be found here. Ubuntu-specific instructions can also be found on the wiki.

Friday, November 4, 2011

American Numismatic Society Unveils ARCHER: A Breakthrough in Archival Cataloging

New York, NY - American Numismatic Society today unveiled the new interactive ANS archival search tool, ARCHER (numismatics.org/archives/). ARCHER (ARCHival Electronic Resource) provides access to the ANS’ unrivalled collection of unique archival material, through a series of simple search screens.

ARCHER is powered by EADitor, open source software developed by the ANS for creating, managing, and publishing collections of archival finding aids described in Encoded Archival Description (EAD) XML. Like the society’s Collection database, MANTIS, ARCHER's public interface includes faceted searching and mapping of the collection. The administrative interface takes advantage of XForms, next-generation web forms, which enables the Society’s archivist, David Hill, to create the finding aids with an intuitive user interface that requires no technical knowledge of XML, a major advancement over many institutions' current workflows.

“The Society’s archives contains a rich diversity of material”, notes Deputy Director Andrew Meadows. “On the one hand there are the Society’s own archival records, spanning more than 150 years of our activity since 1858. As such, the archive is one of the oldest held by any learned society in the country. But in addition to that, the ANS also holds a remarkable archive of international significance, consisting of the papers of famous scholars, collectors and dealers in the field of numismatics. Documents, letters, notebooks and photographs relating to many famous individuals from within the numismatic community and beyond are held at the ANS. ARCHER provides a remarkably straightforward tool both for their cataloging, and for their search by the general public.”

The search interface is built on similar technology to the Society’s MANTIS database. Simple faceted searching based on personal names, places, dates, genre and Library of Congress subject headings are all provided. ARCHER also provides a map interface, which will allow users to visualize the places on the globe with relevant ANS archival material.

ANS Archivist David Hill explains, “What really sets ARCHER apart from other archives management systems is that it combines a simple method for the creation of full EAD with a powerful, built-in publishing feature that produces remarkably sophisticated finding aids. The faceted search capabilities give us an enhanced level of control over our archival holdings, ensuring that ANS staff and researchers can always find relevant materials from across our various collections.”

The design of ARCHER is the work of ANS Web Developer Ethan Gruber, working in close collaboration with David Hill. The database supports a variety of export formats that will encourage exploration of the links between numismatics and other disciplines. As with MANTIS, underlying this work is a technical approach called ‘Linked Open Data’. For the future this will mean increased opportunities both for the ANS to integrate searches across the whole range of its collections, such as books, coins, archives, and also for the rest of the world wide web to discover and link to our material. As Meadows notes, “The new suite of ANS search tools, comprising DONUM (the library catalog), MANTIS and ARCHER, will bring numismatic material to whole new audiences. In this the ANS is leading the world.”

For more information, contact ANS Deputy Director Andrew Meadows (212) 571-4470 ext. 111, meadows@numismatics.org.

The American Numismatic Society, organized in 1858 and incorporated in 1865 in New York State, operates as a research museum under Section 501(c)(3) of the Internal Revenue Code and is recognized as a publicly supported organization under section 170(b)(1)(A)(vi) as confirmed on November 1, 1970.

Bringing the ANS archives to a larger audience

Just a quick update on recent happenings...

Over the last several days, I have been working on creating an OAI-PMH service layer in Archer, the ANS's implementation of EADitor.  It is complete and passed all of the OAI validation tests a few minutes ago.  Openarchives.org has successfully registered it.  Like the Atom feed, the OAI service is generated dynamically from Solr search results.  Archer's OAI service will be added to the EADitor general distribution trunk soon, so you'll see it in the next beta release (coming by the end of the year)!

Thursday, October 20, 2011

Mapping Archival Collections via Atom

As mentioned in a previous blog post, EADitor enables georeferencing of archival collections, either as a whole or perhaps on the component level, by tapping into APIs provided by geonames.org.  A mapping interface solicits user input of facets to query EADitor's built-in Solr-to-KML service to render results in the form of an OpenLayers map.

Currently, only one collection in the American Numismatic Society Archives has been georeferenced, but we expect to add more geographic locations in the coming weeks and months as we continue to refine the collections' descriptions.  The more places that have been added into the Solr index, the more useful the Ajax-driven facet querying interface will be.

On top of this, GML points have been added into the Atom feed for the collection.  Like KML, the Atom feed is driven by Solr and can be queried with the Lucene syntax.  Google Maps is capable of rendering an Atom feed which contains GML points, which you can see here.

The ANS code is about a week ahead of what exists in the Google Code trunk for the EADitor general distribution, but I will work on integrating these new mapping features into the trunk before the beta release that should come in early November.

Thursday, September 29, 2011

A Brief Description of Recent Updates

The American Numismatic Society's finding aid collection (an ANS implementation of EADitor) is tentatively set to be released in late October, and I am planning a major release of the EADitor general distribution beta to coincide with this event.  Over the last few weeks, I have been making some improvements to the code, several of which I will discuss briefly in this post.  A number of these improvements have been borrowed from Numishare, which is our software for managing and publishing collections of coins and similar objects.  Like EADitor, Numishare's back-end is built on top of XForms and the server-side XForms processor, Orbeon, so much of the code is interchangeable.

The list:
  1. A major effort to remove hard-coded URLs to eXist and Solr, placing them, instead, into configuration files that can be edited by an XForms web form.
  2. Introduction of themes configured through a web form.  Though rudimentary so far, they are documented here.  Currently, the user can select from a list of available jQuery UI themes and select the layout of the search results page: facet list in a left or right-aligned column.
  3. Flickr API integration.
  4. The introduction of a new Solr core for storing docs of every finding aid in eXist (including those not published), so that the file list page in the admin section loads faster and enables full-text searching of the collection.  I will eventually develop methods for more advanced searching and sorting of records in the admin page.  Each time the file list page is loaded, it verifies the number of eXist documents is the same as Solr documents, so that if finding aids are added/removed with the eXist client, the Solr index is flushed and the collection reindexed.
  5. Batch publication of all finding aids to the public Solr index and batch removal of all documents from this index.
  6. The ability to edit a container-type template.
  7. Significant improvements in performance due to rewriting of some XBL components in which a glitch in my code resulting in commands being fired off numerous times, and other general bug fixes.
There aren't many more improvements or features that I would like to make to the code before the next major release in several weeks, but I would like to begin writing documentation for use and testing with Fedora repositories.






Screenshot showing "sunny" jQuery UI theme and a thumbnail served from flickr.  The banner text "EADitor: XForms for EAD" is stored in the configuration file and editable in the admin section.