elag 2005. cern, geneva, mercredi 1 er juin 2005 infoscience.epfl.ch i nfoscience epfls...

24
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl. ch Infoscience EPFL’s Institutional Repository … and much more 1. Objectives 2. Means 3. Content & Services 4. Infoscience vs OAI 5. PhDTheses@epfl 6. Next steps forward David Aymonin Directeur de l’Information Scientifique et des Bibliothèques

Upload: elian-hurlbut

Post on 29-Mar-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

InfoscienceEPFL’s Institutional Repository

… and much more

1. Objectives

2. Means

3. Content & Services

4. Infoscience vs OAI

5. PhDTheses@epfl

6. Next steps forwardDavid Aymonin

Directeur de l’InformationScientifique et des

Bibliothèques

Page 2: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Page 3: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Collect and make known the EPFL intellectual heritage, i.e. its scientific and teaching output

• Make researchers and their skills more visible

• Make the scientific data collected more accessible and legible,by structuring them

• Allow their long term preservation

• Allow their processing for the assessment needs of the institution

Objectives

Page 4: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Human resources– Full-time project leader, new position– Ad-hoc team : EPFL staff, when needed

• Technical choice– Based on CDSWare, XMLMARC, Python

language– Official partnership with CERN for CDSWare

software development

Means

Page 5: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

ThesesScientific outputs People@epflUnion catalogue

Infoscience

755 profiles of

researchers

40 000 references.11 libraries *

Content, 1st june 2005

Searches : 793/day, ± 24 000/monthExports : 188/day, ± 5 600/monthFulltexts : 114/day, ± 3 500/month

4 799 references1 500 fulltext

26 laboratories

3 274 references733 fulltext

* : 400 000 references from 11 EPFL libraries, members of NEBIS Union catalogue will be added on october 2005

Page 6: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Services : Data export

Page 7: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Old version. LANOS Lab. Website

Localdatabase

Services : Data re-use

Page 8: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

infoscience

New version. LANOS Lab. Website

Services : Data re-use

Page 9: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Services : People@epfl

Old version. LANOS Lab. Directory

Page 10: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

New version. LANOS Lab. Directory

Services : People@epfl

Page 11: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Freely accessible PhD theses are declared in OAIster

• Already tried to declare Infoscience in Scirus, Google scholar… Not as simple as it should be

• Next : ISI Web citation index http://scientific.thomson.com/news/newsletter/2005-02/8264025/

• Regarding OA, EPFL attitude is « moderate »– Variable from one lab. to another, Open access still frightens– Advocacy OAI will be done in 2005– Official statement from Conférence des Universités Suisses

awaited and could help

Infoscience & OAI

Page 12: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• 1920 Paper archiving at the Central library. 3000 PhD theses

• 2003 Electronic archiving made possible

• June 2004 Electronic archiving compulsory. 200 PhD theses

/year

• Retrodigitalization of all PhD theses, started end 2004

600 000 pages.300 dpi, B&W or 150 dpi, grey levels for color pagesTIFF provided, PDF 1.4 image onlineOCR for liminary pages of 2000-2004 thesesTotal amount of data 15 Gb

PhDTheses@epfl

Page 13: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Workflow , PhDTheses@epfl

Central Library EPFL

Print versionN copies

PDF file

THESE File FINAL VERSIONPDF or Postscript

Printing service EPFL

Libraries :National,

ETHZ Swiss National Library

Asking for Authorisation to put PhD these on the Internet, if NOT, then on Intranet

File processing,Putting online

Academic registration

service EPFL

Information: Final version released

PhD Student

Metadata

Page 14: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

OAIenable

d

Metadata processing, PhDTheses@epfl

Cataloguing in NEBIS

Filemaker Web

database

Abstract Fulltext

2 Catalogue

s (unfortuna

tely)

DTD REROData enrichme

nt

INFOSCIENCECDSWare

copy and

paste

Loading XML

recordsWith

abstractsAbstract

OAI-PMH

Link to Filemaker web record

Page 15: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Made by the central library for each theseCreation of final

• Frontpage (in PDF)• Abstract (in PDF + HTML)• TOC (in PDF)

Optimization of heavy PDF filesSecurity

• Modification not allowed• Printing, searching , copy allowed

File metadata• Size of file, links to abstracts and fulltext, • Number of pages

PDF files processing, PhDTheses@epfl

Page 16: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Main issues– Intellectual Property Rights (IPR)– Metadata and file formats– Harvesting and visibility– Master dissertations

The future is now, PhDTheses@epfl

Page 17: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• IPR– Belongs to the Author– Electronic archiving made compulsory upon

student registration– Changes in swiss law in 2005

Putting theses on the InternetIs not considered as prior publication

The future is now, PhDTheses@epfl

Page 18: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Metadata and file format– Metadata

• Swiss « DTD » ? (Convention in 2003)

• DTD-MS from NDLTD ? (already exists)

• TEF from AFNOR ? (awaited for 2005)

– PDF• PDF/A ? (ISO Standard in 2005?)

Standards could appear in 2005

The future is now, PhDTheses@epfl

Page 19: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Harvesting and visibility– OAIster

• RERO, EPFL, ETHZ already included• Good Search interface• Not specific to theses

– Should we join NDLTD ?• 195 members : UK, B, S, D, E, CN, AU,

China, India• Very poor Search interface, for the moment

– European Thesis On Line (ETOL, Europe)• Just at its begining

Militate in favour ofSwitzerland member of NDLTD

The future is now, PhDTheses@epfl

Page 20: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

If needed, more info about:

Page 21: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

If needed, more info about:

Page 22: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

If needed, more info about:

Page 23: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

• Master dissertations– The next step– Will require along and difficult institutional

agreement to bep fully set up– In each of the 12 academic sections of EPFL

collaboration with librarians, who are in contact with the teachers

The Infoscience tool is robustAnd allows us to work with voluntary

people !

The future is now, PhDTheses@epfl

Page 24: ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch I nfoscience EPFLs Institutional Repository … and much more 1.Objectives 2.Means 3.Content

ELAG 2005. CERN, Geneva, Mercredi 1er juin 2005

infoscience.epfl.ch

Long live OAI !

Thank you for your attention

http://infoscience.epfl.ch

[email protected]