Transcript Infoscience
infoscience.epfl.ch
1.
2.
3.
4.
5.
6.
I
nfoscience
EPFL’s Institutional Repository … and much more
Objectives Means Content & Services Infoscience vs OAI PhDTheses@epfl Next steps forward David Aymonin Directeur de l’Information Scientifique et des Bibliothèques ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 infoscience.epfl.ch
Objectives infoscience.epfl.ch
•
Collect and make known the EPFL intellectual heritage , i.e. its scientific and teaching output
•
Make researchers and their skills more visible
•
Make the scientific data collected more accessible and legible ,by structuring them
•
Allow their long term preservation
•
Allow their processing the institution for the assessment needs of ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Means infoscience.epfl.ch
•
Human resources
– Full-time project leader , new position – Ad-hoc team : EPFL staff, when needed •
Technical choice
– Based on CDSWare , XMLMARC , Python language – Official partnership with CERN software development for CDSWare ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Content, 1st june 2005 Searches : Exports : Fulltexts : 793/day, ± 24 000/month 188/day, ± 114/day, ± 5 600/month 3 500/month infoscience.epfl.ch
Infoscience Scientific outputs Theses People@epfl Union catalogue 4 799 references 1 500 fulltext 26 laboratories 3 274 references 733 fulltext 755 profiles of researchers 40 000 references.
11 libraries * * : 400 000 references from 11 EPFL libraries, members of NEBIS Union catalogue will be added on october 2005 ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Services : Data export infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Services : Data re-use infoscience.epfl.ch
Old version. LANOS Lab. Website
Local database ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Services : Data re-use infoscience.epfl.ch
New version. LANOS Lab. Website
infoscience ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Services : People@epfl infoscience.epfl.ch
Old version. LANOS Lab. Directory
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Services : People@epfl infoscience.epfl.ch
New version. LANOS Lab. Directory
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Infoscience & OAI infoscience.epfl.ch
•
Freely accessible PhD theses are declared in OAIster
•
Already tried to declare Infoscience in Scirus, Google scholar… Not as simple as it should be
•
Next : ISI Web citation index http://scientific.thomson.com/news/newsletter/2005-02/8264025/
•
Regarding OA, EPFL attitude is « – Advocacy OAI will be done in 2005 awaited and could help moderate » – Variable from one lab. to another, Open access still frightens – Official statement from Conférence des Universités Suisses ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
PhDTheses@epfl infoscience.epfl.ch
•
1920 Paper archiving at the Central library. 3000 PhD theses
•
2003 Electronic archiving made possible
•
June 2004 Electronic archiving compulsory . 200 PhD theses /year
•
Retrodigitalization of all PhD theses, started end 2004 600 000 pages.
300 dpi, B&W or 150 dpi, grey levels for color pages TIFF provided, PDF 1.4 image online OCR for liminary pages of 2000-2004 theses Total amount of data 15 Gb ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
Workflow , PhDTheses@epfl PhD Student THESE File FINAL VERSION PDF or Postscript Academic registration service EPFL Print version N copies Printing service EPFL PDF file Libraries : National, ETHZ Swiss National Library ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005 Central Library EPFL Metadat a File processing, Putting online infoscience.epfl.ch
Information: Final version released Asking for Authorisation to put PhD these on the Internet, if NOT, then on Intranet
Metadata processing, PhDTheses@epfl 2 Catalogues (unfortunately ) Cataloguing in NEBIS copy and paste Data enrichment Filemaker Web database Abstract Fulltext infoscience.epfl.ch
OAI-PMH INFOSCIENCE CDSWare OAI enabled Loading XML records With abstracts Abstract DTD RERO Link to Filemaker web record ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
PDF files processing, PhDTheses@epfl infoscience.epfl.ch
Made by the central library for each these Creation of final • Frontpage (in PDF) • Abstract (in PDF + HTML) • TOC (in PDF) Optimization of heavy PDF files Security • Modification not allowed • Printing, searching , copy allowed File metadata • Size of file, links to abstracts and fulltext, • Number of pages ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
The future is now, PhDTheses@epfl •
Main issues
– Intellectual Property Rights ( IPR ) – Metadata and file formats – Harvesting and visibility – Master dissertations infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
The future is now, PhDTheses@epfl •
IPR
– Belongs to the Author – Electronic archiving made compulsory upon student registration – Changes in swiss law in 2005 infoscience.epfl.ch
Putting theses on the Internet Is not considered as prior publication
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
The future is now, PhDTheses@epfl •
Metadata and file format – Metadata
• Swiss « DTD » ? (Convention in 2003) • DTD-MS from NDLTD ? (already exists) • TEF from AFNOR ? (awaited for 2005)
• PDF/A ? (ISO Standard in 2005?)
Standards could appear in 2005
infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
The future is now, PhDTheses@epfl infoscience.epfl.ch
•
Harvesting and visibility
– OAIster • RERO, EPFL, ETHZ already included • Good Search interface • Not specific to theses – Should we join India NDLTD ?
• 195 members : UK, B, S, D, E, CN, AU, China, • Very poor Search interface, for the moment – European Thesis On Line (ETOL, Europe) • Just at its begining
Militate in favour of Switzerland member of NDLTD
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
If needed, more info about: infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
If needed, more info about: infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
If needed, more info about: infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
The future is now, PhDTheses@epfl infoscience.epfl.ch
•
Master dissertations
– The next step – Will require along and difficult institutional agreement to bep fully set up – In each of the 12 academic sections of EPFL collaboration with librarians, who are in contact with the teachers
The Infoscience tool is robust And allows us to work with voluntary people !
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005
infoscience.epfl.ch
ELAG 2005. CERN, Geneva, Mercredi 1 er juin 2005