Updated May 2024 by Armaan Kalkat
INTRODUCTION
The purpose of this project is to ensure that publisher information on items in the Baldwin collection is reflected in the correct place in the metadata and is not duplicated elsewhere on the record, especially in the creator field. The reason publishers often show up in the creator field on UFDC has to do with the way the cataloging was done on these items and how Sobek ingests OCLC records to populate the UFDC metadata. Publishers are often added as additional corporate bodies under a 710 MARC field as well as the traditional 260 or 264 MARC fields on the MARC record. UFDC maps the entries in the 710 field to creators and the 260 or 264 to publisher, meaning the publisher can be listed twice on the record. This contributes to overloaded records and can make it more difficult to tell who was involved in what aspects of an item’s production. This LibGuide is primarily intended as an internal document for metadata staff working with the UFDC but is also available publicly to increase transparency of our workflows and activities.
SUMMARY OF STEPS
For more detailed information, click "Steps" in the above menu.
METHODS USED
The browse by creator list uses the tag "&" for the ampersand sign while the publisher list uses the ampersand. Since we will be comparing the two lists, use Excel's find and replace to replace "&" with "&". This also improves readability.
The following resources are often helpful when researching Baldwin publishers:
Excel Formula for finding matches between two columns:
=IF(COUNTIF($[publisherColumn]:$[publisherColumn], $[creatorColumn]2)=0, "", "MATCH"
Replace [publisherColumn] and [creatorColumn] with the respective column code in your spreadsheet, e.g. ($I:$I, $B2) and drag down the column to apply formula to all rows; this assumes your top row is for column names and your list starts on row 2.