metadata and XML services for libraries, publishers and digital repositories

support for libraries

collecting in the digital realm

  • ¥a selection of digital resources in support of collection development policy
  • ¥b harvesting metadata from digital repositories
  • ¥c "cross-walking" to other metadata formats; e.g., MARC21, MARCXML, MODS, Solr, Dublin Core, etc.
  • ¥d subject, author collection subsets from open access sources (such as the HathiTrust, PubMed Central, open access journal articles, etc.) encoded in MARC21, MODS, etc.
  • ¥e selections from HathiTrust collection of digitized books and serials...for more information.
  • ¥f MARC records for articles and journals in PubMed Central...for more information

global database editing

  • ¥a processing vendor metadata records: MARC21, MARCXML, ONIX for books, continuing resources
  • ¥b global editing for MARC record validation
  • ¥c global editing of bibliographic and authority records
  • ¥d bibliographic record merging, e.g., vendor records for electronic resources merged with existing records
  • ¥e moving content to new fields, subfields
  • ¥f preparing data for system migration: bibliographic, authority and patron records

electronic resource management (ERM)

  • ¥a html markup for A-Z lists of journal subscriptions with RSS links to current table of contents
  • ¥b RSS links to journal table of contents added to MARC records
  • ¥c transforming XML data:
    • ¥1 ONIX for serials (continuing resources) to MARC, other formats
    • ¥2 ONIX-PL (licensing) to html, MARC
    • ¥3 COUNTER (usage statistics) to HTML, PDF, CSV
    • ¥4 harvesting with SUSHI (protocol for accessing vendor usages statistics)

support for publishers

  • ¥a metadata encoding for publications
  • ¥b MARC21 and/or MARCXML
  • ¥c ONIX for books
  • ¥d ONIX for continuing resources (serials)
  • ¥e ONIX-PL for licensing encoding
  • ¥f enhancement of MARC/ONIX records with table of contents (field 505/d104) or summary (field 520/d101)
  • ¥g MODS, METS, Dublin Core, Qualified Dublin Core, others

support for digital repositories

  • ¥a metadata crosswalking: comma-, tab-delimited text files => XML => Dublin Core, QDC, DDI, MARC21, METS, MODS, etc.
  • ¥b metadata encoding
  • ¥c controlled vocabularies
  • ¥d classification codes, language and temporal encoding standards
  • ¥e metadata quality control
  • ¥f URL normalization
  • ¥g preservation related tasks

XML processing services

XML, the language of markup, is the format of choice for metadata encoding. Technologies related to XML processing include the XML Data Model, XPath and the eXtensible Stylesheet Language (XSL) and/or XQuery for transforming XML into other XML documents, HTML, PDF, RDF/XML, RSS, etc.

  • ¥a stylesheet authoring
  • ¥b transforming metadata content to standards such as dates (ISO 8601)
  • ¥c terminological control
  • ¥d converting other formats, e.g., MS Access, spreadsheets, Filemaker, CSV to XML
  • ¥e crosswalk XML files to metadata standards such as Dublin Core, MARC, MODS, etc.
  • ¥f resolving character encoding entity issues