Slajd 1 - PL-Grid
Download
Report
Transcript Slajd 1 - PL-Grid
Polish Infrastructure
for Supporting Computational Science
in the European Research Space
The Capabilities of the GridSpace2
Experiment Workbench
Jan Meizner
and Distributed Computing Environments (DICE)
Team
Academic Computer Centre CYFRONET
i3: Internet - Infrastruktury - Innowacje
1-3.12.2010
EUROPEAN UNION
Motivation
Complex scientific applications on
modern computing infrastructures
Clusters, Grids, Clouds
Diverse software packages
Applications (Gaussian, NAMD,…)
Web Services
Scripts: Perl, Python, Ruby
Different users
Chemists, biologists
Programmers
End users
Various data types
Files, databases, URLs
Exploratory programming
Unstructured, dynamic, prototyping
Collaboration
Teams, communities
GridSpace2 Objectives
Facilitate dealing with application throughout its entire
lifecycle (development, deployment, sharing, operation,
maintenance) from single “workbench” where all
available software is integrated
Reflect and support a natural daily style of work with a
suite of software – workflows, (not formalized)
procedures, task paths etc.
Addresses a specific type of application called
experiments
GridSpace2 Features
Platform – as opposed to concrete application
General-purpose
Exploits Web 2.0 opportunities in facilitating
application development, operation, provisioning
GridSpace2 Experiment
Experiment - a process that combines a
sequence of activities (usage of programs,
services) that act on input data in order to
produce experiment results
Experiment plan – a specification of the sequence
of activities
Experiment run – an enactment of the experiment
plan on particular input data, producing particular
results
Complex workflow going beyond manual simple
and repeatable execution of single programs
Exploratory programming
Unstructured, dynamic, prototyping, further
activities not known a priori
GridSpace2 Experiment Plan
Combines steps realized on a range of software
environments, platforms, tools, languages etc
Developed, shared and reused collaboratively
amongst ad-hoc researching teams
Composed of collaboratively owned libraries and
services used (called gems) and experiment
parts (called snippets)
Exploratory Programming
Involves experimentation and
exploring – step by step programming
where steps are likely not known in
advance but rather provided ad-hoc
basing on the results of previous ones
Experiment needs to be re-enacted
many times with some ad-hoc
customization made dynamically
while the workflow enactment has
already started
Cannot be fully automated and needs
continuous supervision, validation or even intrusion
Dynamic nature of experiment plan – certain decisions taken at
runtime (e.g. code provided from input data)
But: Despite its indirect development process experiment still needs
to be traceable, verifiable, easily re-runnable and its outcome –
straightforwardly reproducible,
Security in GridSpace2
Connectivity via:
HTTPS (browser <-> Experiment Workbench)
SSH/SCP/SFTP (ExperimentWorkbench <-> Experiment Host)
User account context on Experiment Host
OS-level accessibility rights to files
Snippet code can contain a „secret” literal introduced by meta-markup
<SECRET:MY_SECRET_NAME>
During the execution this meta-markup is replaced with secret value
taken from personal secret database called Wallet
Available Wallet implementations:
Simple file database located on the Experiment Host
Remote Central Wallet (ReCeW)
ReCeW (Remote Central Wallet) – key features:
Security – HTTPS protected REST API and AES-256 encryption
of stored credentials
Highly efficient implementation as native (C++) application
Extendable through plug-in mechanism (4 types of plug-ins)
Working with GridSpace2
Easy access using Web
browser
Experiment Workbench
Constructing experiment
plans from code snippets
Interactively run
experiments
Experiment Execution
Environment
Multiple interpreters
Access to libraries,
programs and services
(gems)
Access to computing
infrastructure
Cluster, grid, cloud
9
Application:
Analysis of water solutions of aminoacids
Involving multiple steps realized with many
tools, langauges and libraries used for
Packmol – molecular dynamics simulations
of packing molecules in a defined regions
of space
Jmol – visualization of solution
Gaussian – computing a spectrum of the
solution
Python/CCLIB – extracting spectrum info
jqPlot – displaying plot
Collaboration with computational chemists of ACC Cyfronet AGH and
Departament of Chemistry, Jagiellonian University, Dr. Mariusz Sterzel,
Klemens Noga
Conclusions
Complex scientific applications need dedicated tools and
approaches.
In-silico experiments are supported by Virtual Laboratory
powered by GridSpace2 technology.
Applications:
Bioinformatics
Computational chemistry
More are welcome!
Virtual laboratory is open for PL-Grid users.
11
References
http://wl.plgrid.pl
– open the Virtual Laboratory in your browser
http://dice.cyfronet.pl/gridspace
– learn more about GridSpace technology
http://dice.cyfronet.pl/
– Distributed Computing Environemnts Team (DICE)
website
12