db pearson
Master of Library and Information Sciences
dbpearsonMLIS.com
about me
libraries
metadata harvesting; cataloging; authority control; metadata encoding; electronic resource management; XML processing
publishers
metadata encoding: MARC, MARCXML, ONIX books and serials, ONIX-PL, XML processing
digital repositories
metadata encoding: Dublin Core, Qualified Dublin Core, METS, PREMIS, VRA, MIX, more; preservation activities, training
contact info
for complete contact information
metadata and XML services for libraries, publishers and digital repositories
support for libraries
collecting in the digital realm
- ¥a selection of digital resources in support of collection development policy
- ¥b harvesting metadata from digital repositories
- ¥c "cross-walking" to other metadata formats; e.g., MARC21, MARCXML, MODS, Solr, Dublin Core, etc.
- ¥d subject, author collection subsets from open access sources (such as the HathiTrust, PubMed Central, open access journal articles, etc.) encoded in MARC21, MODS, etc.
- ¥e selections from HathiTrust collection of digitized books and serials...for more information.
- ¥f MARC records for articles and journals in PubMed Central...for more information
global database editing
- ¥a processing vendor metadata records: MARC21, MARCXML, ONIX for books, continuing resources
- ¥b global editing for MARC record validation
- ¥c global editing of bibliographic and authority records
- ¥d bibliographic record merging, e.g., vendor records for electronic resources merged with existing records
- ¥e moving content to new fields, subfields
- ¥f preparing data for system migration: bibliographic, authority and patron records
electronic resource management (ERM)
- ¥a html markup for A-Z lists of journal subscriptions with RSS links to current table of contents
- ¥b RSS links to journal table of contents added to MARC records
- ¥c transforming XML data:
- ¥1 ONIX for serials (continuing resources) to MARC, other formats
- ¥2 ONIX-PL (licensing) to html, MARC
- ¥3 COUNTER (usage statistics) to HTML, PDF, CSV
- ¥4 harvesting with SUSHI (protocol for accessing vendor usages statistics)
support for publishers
- ¥a metadata encoding for publications
- ¥b MARC21 and/or MARCXML
- ¥c ONIX for books
- ¥d ONIX for continuing resources (serials)
- ¥e ONIX-PL for licensing encoding
- ¥f enhancement of MARC/ONIX records with table of contents (field 505/d104) or summary (field 520/d101)
- ¥g MODS, METS, Dublin Core, Qualified Dublin Core, others
support for digital repositories
- ¥a metadata crosswalking: comma-, tab-delimited text files => XML => Dublin Core, QDC, DDI, MARC21, METS, MODS, etc.
- ¥b metadata encoding
- ¥c controlled vocabularies
- ¥d classification codes, language and temporal encoding standards
- ¥e metadata quality control
- ¥f URL normalization
- ¥g preservation related tasks
XML processing services
XML, the language of markup, is the format of choice for metadata encoding. Technologies related to XML processing include the XML Data Model, XPath and the eXtensible Stylesheet Language (XSL) and/or XQuery for transforming XML into other XML documents, HTML, PDF, RDF/XML, RSS, etc.
- ¥a stylesheet authoring
- ¥b transforming metadata content to standards such as dates (ISO 8601)
- ¥c terminological control
- ¥d converting other formats, e.g., MS Access, spreadsheets, Filemaker, CSV to XML
- ¥e crosswalk XML files to metadata standards such as Dublin Core, MARC, MODS, etc.
- ¥f resolving character encoding entity issues