Transcript Infoscience

infoscience.epfl.ch

1.

2.

3.

4.

5.

6.

I

nfoscience

EPFL’s Institutional Repository … and much more

Objectives Means Content & Services Infoscience vs OAI PhDTheses@epfl Next steps forward David Aymonin Directeur de l’Information Scientifique et des Bibliothèques ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch

Objectives infoscience.epfl.ch

Collect and make known the EPFL intellectual heritage , i.e. its scientific and teaching output

Make researchers and their skills more visible

Make the scientific data collected more accessible and legible ,by structuring them

Allow their long term preservation

Allow their processing the institution for the assessment needs of ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Means infoscience.epfl.ch

Human resources

– Full-time project leader , new position – Ad-hoc team : EPFL staff, when needed •

Technical choice

– Based on CDSWare , XMLMARC , Python language – Official partnership with CERN software development for CDSWare ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Content, 1st june 2005 Searches : Exports : Fulltexts : 793/day, ± 24 000/month 188/day, ± 114/day, ± 5 600/month 3 500/month infoscience.epfl.ch

Infoscience Scientific outputs Theses People@epfl Union catalogue 4 799 references 1 500 fulltext 26 laboratories 3 274 references 733 fulltext 755 profiles of researchers 40 000 references.

11 libraries * * : 400 000 references from 11 EPFL libraries, members of NEBIS Union catalogue will be added on october 2005 ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Services : Data export infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Services : Data re-use infoscience.epfl.ch

Old version. LANOS Lab. Website

Local database ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Services : Data re-use infoscience.epfl.ch

New version. LANOS Lab. Website

infoscience ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Services : People@epfl infoscience.epfl.ch

Old version. LANOS Lab. Directory

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Services : People@epfl infoscience.epfl.ch

New version. LANOS Lab. Directory

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Infoscience & OAI infoscience.epfl.ch

Freely accessible PhD theses are declared in OAIster

Already tried to declare Infoscience in Scirus, Google scholar… Not as simple as it should be

Next : ISI Web citation index http://scientific.thomson.com/news/newsletter/2005-02/8264025/

Regarding OA, EPFL attitude is « – Advocacy OAI will be done in 2005 awaited and could help moderate » – Variable from one lab. to another, Open access still frightens – Official statement from Conférence des Universités Suisses ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

PhDTheses@epfl infoscience.epfl.ch

1920  Paper archiving at the Central library. 3000 PhD theses

2003  Electronic archiving made possible

June 2004  Electronic archiving compulsory . 200 PhD theses /year

Retrodigitalization of all PhD theses, started end 2004 600 000 pages.

300 dpi, B&W or 150 dpi, grey levels for color pages TIFF provided, PDF 1.4 image online OCR for liminary pages of 2000-2004 theses Total amount of data 15 Gb ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Workflow , PhDTheses@epfl PhD Student THESE File FINAL VERSION PDF or Postscript Academic registration service EPFL Print version N copies Printing service EPFL PDF file Libraries : National, ETHZ Swiss National Library ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 Central Library EPFL Metadat a File processing, Putting online infoscience.epfl.ch

Information: Final version released Asking for Authorisation to put PhD these on the Internet, if NOT, then on Intranet

Metadata processing, PhDTheses@epfl 2 Catalogues (unfortunately ) Cataloguing in NEBIS copy and paste Data enrichment Filemaker Web database Abstract Fulltext infoscience.epfl.ch

OAI-PMH INFOSCIENCE CDSWare OAI enabled Loading XML records With abstracts Abstract DTD RERO Link to Filemaker web record ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

PDF files processing, PhDTheses@epfl infoscience.epfl.ch

Made by the central library for each these Creation of final • Frontpage (in PDF) • Abstract (in PDF + HTML) • TOC (in PDF) Optimization of heavy PDF files Security • Modification not allowed • Printing, searching , copy allowed File metadata • Size of file, links to abstracts and fulltext, • Number of pages ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

The future is now, PhDTheses@epfl •

Main issues

– Intellectual Property Rights ( IPR ) – Metadata and file formats – Harvesting and visibility – Master dissertations infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

The future is now, PhDTheses@epfl •

IPR

– Belongs to the Author – Electronic archiving made compulsory upon student registration – Changes in swiss law in 2005 infoscience.epfl.ch

Putting theses on the Internet Is not considered as prior publication

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

The future is now, PhDTheses@epfl •

Metadata and file format – Metadata

• Swiss « DTD » ? (Convention in 2003) • DTD-MS from NDLTD ? (already exists) • TEF from AFNOR ? (awaited for 2005)

– PDF

• PDF/A ? (ISO Standard in 2005?)

Standards could appear in 2005

infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

The future is now, PhDTheses@epfl infoscience.epfl.ch

Harvesting and visibility

– OAIster • RERO, EPFL, ETHZ already included • Good Search interface • Not specific to theses – Should we join India NDLTD ?

• 195 members : UK, B, S, D, E, CN, AU, China, • Very poor Search interface, for the moment – European Thesis On Line (ETOL, Europe) • Just at its begining

Militate in favour of Switzerland member of NDLTD

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

If needed, more info about: infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

If needed, more info about: infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

If needed, more info about: infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

The future is now, PhDTheses@epfl infoscience.epfl.ch

Master dissertations

– The next step – Will require along and difficult institutional agreement to bep fully set up – In each of the 12 academic sections of EPFL collaboration with librarians, who are in contact with the teachers

The Infoscience tool is robust And allows us to work with voluntary people !

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

infoscience.epfl.ch

ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005

Long live OAI !

Thank you for your attention

http://infoscience.epfl.ch

[email protected]