OAI-PMH for Resource Harvesting

Similar documents
LECTURE 1.1: Enterprise Architecture

Abstract Our project analyzes the water use of apartment complexes in the City of Davis. We

REQUEST FOR PROPOSAL DESIGN-BUILD Request for Proposal No Campbell Creek Estuary Natural Area: Wildlife Overlooks and Rustic Fence

Introduction. Student Comments. Welcome to Sedley Court. High quality student accommodation

Certified International Property Specialist (CIPS) DESIGNATION APPLICATION

Collection 20. Title: NWHSU Collection of Professional Convention Materials

Legal Wing - Federation of Karnataka Chambers of Commerce and Industry (FKCCI), Bengaluru 15 th September, 2016 Article 99

Purpose. ARDI Registration Process

GRADUATE HOUSING CONTRACTS

MiFID II FAQs. For Advisers, Discretionary Portfolio Managers and Product Providers. Praemium Administration Limited

Reclaimed Land A guide for developers applying for an interest in reclaimed land under the Marine and Coastal Area Act 2011

The Bannister Team Prepared for: Compliments of:

General Information for Cooperative Housing Societies in Mumbai

Homeowners Guide To Assignment of Mortgage Payments Sales

APB Mission Statement:

LEGAL BRIEF FORECLOSURE ON RENTAL PROPERTY JANUARY 2016

Membership Fees 2018 Broker

Policy date October 2015 Document version Version 3 National Operations Manager Review date October 2018

City of Richmond Rent Control and Just Cause for Eviction. Fact Sheet

The role of the core geospatial information infrastructure in the protection and management of the natural and cultural heritage of Greece

MacroHomes Design Competition - Call for Submissions. Prize: $10,000 & potential collaboration on a large scale multifamily project

DRAFT 1. GENERAL INFORMATION 2. APPLICATION INSTRUCTIONS

Islington & Shoreditch Housing Association (ISHA) Relationship Breakdown Policy

TO LET. Town Centre Retail Premises. Shop Unit 10, Manchester Chambers, Oldham OL1 1LF.

TENANCY APPLICATION GUIDE TO COMPLETION

ACEP-ALE Program TIP SHEET FOR MICHIGAN APPLICANTS

FACT SHEET Residential, Business, and Wind & Solar Resource Leasing on Indian Land Final Rule

Barnes Walker, Chartered 3119 Manatee Avenue West, Bradenton, Florida Ph: (941) ; F: (941) SORTING OUT SHORT SALES:

DESCRIPTION STANDARDS, OBJECTIVES, AND INDICATORS REAL ESTATE (411) STUDENTS WILL UNDERSTAND THE ECONOMICS OF THE REAL ESTATE INDUSTRY.

Plenary three: How to calculate and apportion service charges effectively

BOUNDARY LINE ADJUSTMENT

JACKSON DE CARVALHO, Ph.D.

PROPOSAL Architectural Services

Single Drawer Accountability (SDA) (NON POS) R2 and R4 Hour Offices Conversion Guide Instructions

GOLDEN ISLES ASSOCIATION OF REALTORS 2019 CIRCLE OF EXCELLENCE APPLICATION FORM

ICN Merger Working Group. Effective Remedies. 16 February 2017 Washington DC

THE PROCESS OF PURCHASE OF A PROPERTY IN SPAIN

MOTION NO. M Beacon Hill Station TOD Property Final Transaction Agreements PROPOSED ACTION

LEVEL 6 UNIT 17 - CONVEYANCING SUGGESTED ANSWERS JUNE 2011

BUYER HANDBOOK. my purpose. I provide high-end service for the Nashville area home buyer.

SHORT TERM RENTAL REGISTRATION APPLICATION. All sections are required to be completed; please apply through portal or print.

Creative City Interior Design and 3D Visualization

APPLICATION. Fee Simple Subdivision Bare Land Strata Conversion of Existing Building into Strata Units

Applications Skill Checks

W yoming M ultiple L isting S ervice

Strategic Planning for RAD Conversions. Thursday, April 6, 2017

FACT SHEET # 32 EVICTION. Introduction

TO LET. Town Centre Offices Flexible Terms. First & Second Floor Offices, Manchester Chambers, Oldham OL1 1LF.

Representative Alissa Keny-Guyer, Chair House Committee on Human Services and Housing Oregon State Legislature 900 Court Street Salem, OR 97301

The Corporation of the City of Stratford

Strategies for Funding Farmland Preservation

Nassau County Department of Planning & Economic Opportunity Nassau Place Yulee, Florida 32097

City of Surrey ADDITIONAL PLANNING COMMENTS File:

Greenlane Staff Residence Information o

Room Selection FAQ s

Procedure for an Permit Application For a Park Model Home or Mobile Home

Registrar of Real Property (RoRP) Client Handbook

Dr Kanishka Karunasena

Opportunity Description: Assistant Property Manager Location: Office in Menlo Park, CA

Ohio Department of Transportation Testimony to the Judiciary Committee of the Ohio House on House Bill 5 (Eminent Domain)

221 East 11 th Street Austin, Texas 78701

MAP REFERENCE HANDBOOK

Barnstable Municipal Airpolt Commission. Barnstable Municipal Airport Improvements Project Barnstable Municipal Airport

SHORELINE ALTERATION/DREDGE AND FILL PERMIT APPLICATION

How to Make Auctions of Public Land

STAFF REPORT - SUMMATION. South Park Estates 9 th Filing Final Plat Process. CASE NUMBER(s): UDC SUBDIVISION CODE: SPKE 09

Lessor Presentation & Disclosure Requirements

Approval to Sell Property at Glenmont Station to Montgomery County

Habitat for Humanity Greater San Francisco Application Meeting: 1009 Mission Street, San Francisco

Module Three - Application of the Method The Seven-Step Method Detail of Measurement Key Points to Remember...

PRE-DESIGN STUDY 6/26/ COLE PLACE ARSHIA ARCHITECTS 550 N LARCHMONT BLVD #100 LOS ANGELES, CA

Site Modification Process for Alcova Reservoir and Pathfinder Reservoir, Natrona County, Wyoming Leaseholders Revised October 6, 2016

FY18 DCS Grant Round Announced

Board of Regents Meeting November 30-December 1, 2006 Agenda Item #32 Arizona State University EXECTIVE SUMMARY Page 1 of 8

APPLICATION DEADLINE PINNACLE AWARDS CELEBRATION THURSDAY JANUARY 17, 2019, 5:00 PM THURSDAY. MARCH 21, 2019 Carlos Hellenic Center Ballroom

RETENTION AND DISPOSAL APPRAISAL REPORT Te Kooti Whenua Māori Māori Land Court Te Kooti Pīra Māori Māori Appellate Court Court records

KBKG Tax Insight: Retail/Restaurant Industry Safe Harbor Under Tangible Property Regulations

MANUFACTURED HOUSING. TIB is excited to announce that we are now accepting CONVENTIONAL CONFORMING MANUFACTURED HOUSING Loans

Zeynep Başaran Bundur CURRENT POSITION EDUCATION

TERMINATION OF TENANCIES FOR TENANT DEFAULT RESULTS OF FORFEITURE OF LEASES QUESTIONNAIRE

March 17, 2015 RICHLAND COUNTY RIGHT OF WAY POLICY

Rental Assistance Demonstration Closing Overview & Checklist: Project Based Voucher (PBV) Conversions

Staff Report. Andrea Ouse, Director of Community and Economic Development

Mid-state association of realtors, inc. The Association with the Personal Connection 73 East Main St. Plainville, CT 06062

University of Alberta: Don Hickey, Anastasia Lim (Chair), Emily Ball, Ben Louie, Pat Jansen, Doug Dawson

ACCESSORY DWELLING UNIT ORDINANCE

WETLANDS PERMIT APPLICATION INFORMATION PACKET

NORTHERN VIRGINIA REGIONAL COMMISSION. Minutes of the Commission Meeting Held Thursday, November 29, 2018

4 LIHTC ONLY, WITH AT LEAST 8 YEARS OF THE ORIGINAL 15-YEAR IRS COMPLIANCE PERIOD REMAINING (AKA NEW LIHTC)

Real Estate Fundamentals (90 Hours)

Pinnacle Award Rules. [Type the document subtitle] 2018 Event. Pinnacle Award Rules. DeKalb Association of REALTORS

HOW TO SUCCEED AT YOUR MANAGEMENT AND OCCUPANCY REVIEW (MOR)

Commissioner Anthony arrived after official Roll Call at 7:20 p.m.

ADVANCED SEMINAR IN ARCHIVES AND RECORDS MANAGEMENT

Real Estate Fundamentals 60 Hour Mississippi Broker Pre-Licensing

Implementing the New Lease Accounting Standard

Paul D. Ralph, BES, RPP, MCIP, Commissioner, Development Services Department

Studio $3875 $3875 $7, bedroom single $4125 $4125 $ bedroom double $3200 $3200 $6400

Video Course Evaluation Form. Atty ID number for Pennsylvania: Name of Course You Just Watched

Myra / Manzanita Neighborhood Coalition 939 Manzanita Street Los Angeles, California

Transcription:

OAI-PMH fr Resurce Harvesting Herbert Van de Smpel Digital Library Research & Prttyping Team Research Library, Ls Alams Natinal Labratry Michael Nelsn Cmputer Science Department Old Dminin University OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Tutrial Outline OAI-PMH fr Resurce Harvesting: prblem statement and cnceptual slutin MPEG-21 DIDL: An XML-based Cmplex Object Frmat fr OAI-PMHbased Resurce Harvesting Accurate mirrring the cllectin f the American Physical Sciety using OAI-PMH-based Resurce Harvesting md_ai: An OAI-PMH-based mdel fr Web Resurce Harvesting OAIResurce: A sftware tl fr OAI-PMH-based Resurce Harvesting OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Resurce Harvesting: Use cases Discvery: use cntent itself in the creatin f services search engines that make full-text searchable citatin indexing systems that extract references frm the full-text cntent brwsing interfaces that include thumbnail versins f high-quality images frm cultural heritage cllectins Preservatin: peridically transfer digital cntent frm a data repsitry t ne r mre trusted digital repsitries trusted digital repsitries need a mechanism t autmatically synchrnize with the riginating data repsitry OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Resurce Harvesting: Use cases Discvery: Institutinal Repsitry & Digital Library Prjects: UK JISC, DARE, DINI Web search engines: cmpetitin fr cntent (cf Ggle Schlar) Preservatin: Institutinal Repsitry & Digital Library Prjects: UK JISC, DARE, DINI Library f Cngress: NDIIP Archive Exprt/Ingest, e-depsit OAI-PMH is well-established. Can OAI-PMH be used fr Resurce Harvesting? OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches Typical scenari: 1. An OAI-PMH harvester harvests Dublin Cre recrds frm the OAI-PMH repsitry. 2. The harvester analyzes each Dublin Cre recrd, extracting dc.identifier infrmatin in rder t determine the netwrk lcatin f the described resurce. 3. A separate prcess, ut-f-band frm the OAI-PMH, cllects the described resurce frm its netwrk lcatin. OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Issue 1 Lcating the resurce based n infrmatin prvided in dc.identifier dc.identifier used t cnvey a variety f identifier: (simultaneusly) URL DOI, bibligraphic citatin, Nt expressive enugh t distinguish between identifier, lcatr. Several derferencing attempts required URI prvided in dc.identifier is cmmnly that f a bibligraphic splash page Hw t knw it is a bibligraphic splash page, nt the resurce? If it is a bibligraphic splash page, where is the resurce? OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Issue 2 Using the OAI-PMH datestamp f the Dublin Cre recrd t trigger incremental harvesting: Datestamp f DC recrd des nt necessarily change when resurce changes DC recrd datestamp n change DC recrd datestamp change n resurce update resurce update n metadata update OK missed resurce update metadata update unnecessary resurce dwnlad OK OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Cnventins Cnventins address Issue 1; Issue 2 can nt really be addressed. First dc.identifier is lcatr f the resurce what if the resurce is nt digital? Use f dc.frmat and/r dc.relatin t cnvey lcatr OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Cnventins <ai_dc:dc> <dc:title>a Simple Parallel-Plate Resnatr Technique fr Micrwave. Characterizatin f Thin Resistive Films</dc:title> <dc:creatr>vrbiev, A.</dc:creatr> <dc:subject>ing-inf/01 Elettrnica</dc:subject> <dc:descriptin>a parallel-plate resnatr methd is prpsed fr nn-destructive characterisatin f resistive films used in micrwave integrated circuits. A slt made in ne... </dc:descriptin> <dc:publisher>micrwave engineering Eurpe</dc:publisher> <dc:date>2002</dc:date> <dc:type>dcument relativ ad una Cnferenza altr Event</dc:type> <dc:type>peerreviewed</dc:type> <dc:identifier>http://amsacta.cib.unib.it/archive/00000014/</dc:identifier> <dc:frmat>pdf http://amsacta.cib.unib.it/archive/00000014/01/gaas_1_vrbiev.pdf </dc:frmat> </ai_dc:dc> splash page lcatr f resurce OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Cnventins <dc:identifier>http://amsacta.cib.unib.it/archive/00000014/</dc:identifier> <dc:relatin> http://amsacta.cib.unib.it/archive/00000014/01/gaas_1_vrbiev.pdf </dc:relatin> splash page lcatr f resurce OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Cnventins <dc:identifier>http://amsacta.cib.unib.it/archive/00000014/</dc:identifier> <dc:relatin> http://reslver.unib.it/00000014/ </dc:relatin> <dc:relatin> http://amsacta.cib.unib.it/archive/00000014/01/gaas_1_vrbiev.pdf </dc:relatin> splash page splash page lcatr f resurce OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Existing OAI-PMH based appraches : Other attempts dc.identifier leads t splash page & splash page cntains special purpse XHTML link t resurce(s) What if there is n splash page? Hw des a harvester knw he is in this situatin? OA-X: prtcl extensin OK in lcal cntext Strategic prblem t generalize Hw t cnslidate with OAI-PMH data mdel Qualified Dublin Cre Culd bring expressiveness t distinguish between lcatr & identifier But what with datestamp issue? OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Prpsed OAI-PMH based apprach Use metadata frmats that were specifically created fr representatin f digital bjects: Cmplex Object Frmats as OAI-PMH metadata frmats MPEG-21 DIDL, METS,.. OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

OAI-PMH data mdel resurce OAI-PMH identifier = entry pint t all recrds pertaining t the resurce item metadata pertaining t the resurce Dublin Cre MARCXML metadata metadata recrds simple mre expressive highly expressive highly expressive OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Cmplex Object Frmats : characteristics Representatin f a digital bject by means f a wrapper XML dcument Represented resurce can be: simple digital bject (cnsisting f a single datastream) cmpund digital bject (cnsisting f multiple datastreams) Unambiguus apprach t cnvey identifiers f the digital bject and its cnstituent datastreams Include datastream: By-Value: embedding f base64-encded datastream By-Reference: embedding netwrk lcatin f the datastream nt mutually exclusive; equivalent Include a variety f secndary infrmatin By-Value By-Reference Descriptive metadata, rights infrmatin, technical metadata, OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

<didl:didl> <didl:item> <didl:descriptr><didl:statement mimetype="text/xml; charset=utf-8"> <dii:identifier> http://amsacta.cib.unib.it/archive/00000014/ </dii:identifier> </didl:statement></didl:descriptr> <didl:descriptr><didl:statement mimetype="text/xml; charset=utf-8"> <ai_dc:dc> <dc:title>a Simple Parallel-Plate Resnatr Technique fr Micrwave. Characterizatin f Thin Resistive Films </dc:title> <dc:creatr>vrbiev, A.</dc:creatr> <dc:identifier> http://amsacta.cib.unib.it/archive/00000014/</dc:identifier> <dc:frmat>applicatin/pdf</dc:frmat> </ai_dc:dc> </didl:statement></didl:descriptr> <didl:cmpnent> <didl:resurce mimetype="applicatin/pdf" ref="http://amsacta.cib.unib.it/archive/00000014/01/gaas_1_vrbiev.pdf"/> </didl:cmpnent> </didl:item> </didl:didl> OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Cmplex Object Frmats & OAI-PMH Resurce represented via XML wrapper => OAI-PMH <metadata> Unifrm slutin fr simple & cmpund bjects Unambiguus expressin f lcatr f datastream Disambiguatin between lcatrs & identifiers OAI-PMH datestamp changes whenever the resurce (datastreans, secndary infrmatin) changes OAI-PMH semantics apply: abut cntainers, set membership OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

OAI-PMH based apprach using Cmplex Object Frmat Typical scenari: 1. An OAI-PMH harvester checks fr supprt f a cmplex bject frmat using the ListMetadataFrmats verb 2. The harvester harvests the cmplex bject metadata. Semantics f the OAI-PMH datestamp guarantee that new and mdified resurces are detected. 3. A parser at the end f the harvesting applicatin analyzes each harvested cmplex bject recrd: - The parser extracts the bitstreams that were delivered By-Value. - The parser extracts the unambiguus references t the netwrk lcatin f bitstreams delivered By-Reference. 4. A separate prcess, ut-f-band frm the OAI-PMH, cllects the bitstreams delivered By-Reference frm the extracted netwrk lcatins. OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Cmplex Object Frmats & OAI-PMH : existing implementatins LANL Repsitry Lcal strage f Terrabytes f schlarly assets Assets stred as MPEG-21 DIDL dcuments DIDL dcuments made accessible t dwnstream applicatins via the OAI-PMH Mirrring f American Physical Sciety cllectin at LANL Maps APS dcument mdel t MPEG-21 DIDL Transfer Prfile Expses MPEG-21 DIDL dcuments thrugh OAI-PMH infrastructure Inlcudes digests/signatures DSpace & Fedra plug-ins md_ai Maps DSpace/Fedra dcument mdel t MPEG-21 DIDL Transfer Prfile Expses MPEG-21 DIDL dcuments thrugh OAI-PMH infrastructure OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Cmplex Object Frmats & OAI-PMH : issues Which Cmplex Object Frmat(s) Hw t Prfile Cmplex Object Frmat(s) fr OAI-PMH Harvesting Large recrds Cmpund bjects with multiple datastreams. What if nly 1 datastream gets updated? Because the resurce is represented as <metadata>, can rights pertaining t the resurce be expressed accrding t the rights fr metadata OAI-rights guideline? Tls: Sftware library t write cmpliant cmplex bjects Integratin f this library with repsitry systems (Fedra, DSpace, eprints.rg,.) Sftware t harvest resurces based n OAI-PMH mdel OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland

Readings Herbert Van de Smpel, Michael Nelsn, Carl Lagze, Simen Warner. Resurce Harvesting witin the OAI-PMH Framewrk. D-Lib Magazine. December 2004. http://dx.di.rg/10.1045/december2004-vandesmpel OAI-PMH fr Resurce Harvesting Tutrial OAI4, Octber 20 th 2005, CERN, Geneva, Switzerland