I said a few entries ago that I was working on an entry on the catalog. I need to hurry up! Some big catalogish things came along this week.
First the University of California released a significant report on its bibliographic infrastructure, on how catalogs should be built, presented and managed.
Rethinking how we provide bibliographic services for the University of California
And then North Carolina State University released its new catalog to some acclaim. This is special in that they have built it themselves outside the integrated library system using Endeca software. Interestingly, the NCSU catalog incorporates many of the things discussed in the UC report, and more of them are on its development path. That suggests that there may be some convergence in thinking about features, although many other questions remain open.
Here are some of the things I took away from the UC report (my words):
- Services. A desire to provide direct access to described items. Bibliographic systems should not run into dead-ends and disappoint the user. A recommender service is desirable. Results should be ranked in meaningful ways.
- Bibliographic structure. A variety of metadata schema should be supported and used. The ISO 2709/MARC/AACR2 stack should not be seen as the default; schema should be appropriate to materials under consideration. There is strong support for FRBR and faceted browse to structure large result sets and provide sensible navigation options. Browse needs to be supported by controlled data for name, place, time period, and uniform title. Interestingly, the report questions whether controlled subject data will be necessary in light of table of contents and other associated data. (Subject browse is a strong feature of the NCSU catalog.)
- At the point of need. There is a recognition that services need to be where the user is, so bibliographic services and data need to be surfaced in course management systems, institutional portals, and search engines. (The library in the user environment.)
- Discovery: consolidation and gravitational pull. There is some discussion of unifying the UC ‘bibliographic universe’, recognizing that a larger integrated data resource exerts a stronger gravitational pull than multiple resources, especially where users do not appreciate the differences between resources.
- Technical processing. There is also discussion of consolidating cataloging and other processing activity, and reducing the cost of bibliographic management. It is suggested that more data needs to be captured upstream from vendors, and that there be more selective programmatic upgrade of data downstream. Which leads to less time spent on manual cataloging.
- Platform and organization. One of the recurrent themes is how to source particular capacities. For example, what platform would a unified catalog be built on: a local integrated library system product or an external party like RLG or OCLC.
- Value. There is an arresting statement on page 9 of the report: “we all agree that the cost of our bibliographic services enterprise is unsupportable as we move into an increasingly digital world, and a solution is nowhere in sight”. Throughout the report, there is an awareness that inefficiencies represent opportunity costs and that these are increasingly intolerable in a changing world with many additional demands.
The NCSU catalog is big news. Why it even makes it as a news item on the home page of NCSU itself!
NC State Vice Provost and Director of Libraries Susan K. Nutter says, “With this groundbreaking approach, the NCSU Libraries is responding to Web searchers who expect to retrieve results in order of relevance. The new system – the first of its kind in a library – empowers users to quickly locate the items they’re looking for or to explore the multifaceted research collection in depth, exploiting both the software’s cutting-edge capabilities and the library’s many decades of investment in detailed cataloging and classification.” [News Release:]
I like this reference to investment. One of the ironies of the library world is that while the creation of bibliographic data was historically central to the library mission and continues to be a major investment, we have not released its full value in systems and services. We have not made that data work as hard as we might, either in the context of the user experience or in terms of the management intelligence that can be mined from it (to support recommender systems, collection analysis, and so on).
And the NCSU catalog does indeed make much more use of the data. There are lots of nice things in it. At first sight, the Endeca faceted browse structure works well with the bibliographic data pivoting on topic, genre, era, format, region, library, language, and author. There is also a general subject browse. It would be interesting to know how users find the approach, although it may be familiar to some from sites like shopping.com. I liked the ‘send search to’ option, where you could repeat the search on other catalogs and search sytems. It would benefit from FRBRization and this is on the agenda for development. I am sure that there will be a lot of discussion about this over the next while, as people put it through its paces – Andrew Pace briefly reviews some features in a web4lib mail (and gives well deserved kudos to his colleague Emily Lynema who worked on the project).
It is interesting that they use circulation data for sorting, as a measure of ‘popularity’. (OCLC uses holdings counts – another type of ‘intentional‘ data.)
It is also interesting that NCSU has chosen this route. Coming fresh from reading the UC report, I was thinking about organizational structures and the role of the catalog in the context of the wider bibliographic database spectrum when I saw the NCSU catalog. NCSU has independently created an impressive resource which they will maintain and further develop. At some stage, I hope that they discuss the considerations that prompted them to go down this path. It would be interesting to know whether this is seen as an interim approach until the market catches up, or whether we are going to see more of this kind of approach in libraries. It would also be interesting to know whether others might benefit from the work done, either through some Endeca offering or in another way. Of course, the catalog covers a part only of the library collection. Would this approach be hospitable to other data (A&I resources for example, with their differently structured data). Over the next few years, a major issue for libraries will be thinking about how to erase some of the boundaries between databases and allow their users to prospect the full literature in easier ways. This will raise important issues about bibliographic structures and practices as the historical investment in cataloging and classification has focused on one part of the library collection. That said, it is really great seeing the data exercised in this way.
So, a significant report and a significant new catalog in one week. It is good to see this rethinking of the catalog, and the wider bibliographic apparatus, alongside the type of innovation in bringing the service to the user that Dave Pattern and others are exploring.
Full disclosure: I had been thinking about catalogs because I have been interviewed for two reports on catalogs and cataloging in the last while. One of them was the UC report discussed above. And of course, OCLC is discussed at various points in the UC report.
Some related entries: