Information architectures theory and practice (Internet, Web, Grid, Cloud) and class project definitions Peter Fox Xinformatics – ITEC/CSCI/ERTH-4400/6400 Week 8, March 26, 2013
Download ReportTranscript Information architectures theory and practice (Internet, Web, Grid, Cloud) and class project definitions Peter Fox Xinformatics – ITEC/CSCI/ERTH-4400/6400 Week 8, March 26, 2013
Information architectures theory and practice (Internet, Web, Grid, Cloud) and class project definitions Peter Fox Xinformatics – ITEC/CSCI/ERTH-4400/6400 Week 8, March 26, 2013 1 Contents • Review of last class, reading • Information architectures theory and practice (Reference, Internet, Web, Grid, Cloud) and class project definitions • Project assignment • Next classes 2 Data-Information-Knowledge Ecosystem Producers Consumers Experience Data Creation Gathering Information Presentation Organization Knowledge Integration Conversation Context 3 (Information) Architecture • Definition: – “is the art of expressing a model or concept of information used in activities that require explicit details of complex systems” (wikipedia) – “… I mean architect as in the creating of systemic, structural, and orderly principles to make something work - the thoughtful making of either artifact, or idea, or policy that informs because it is clear.” Wuman 4 Is this a good example? 5 More detail to connect us • “The term information architecture describes a specialized skill set which relates to the interpretation of information and expression of distinctions between signs and systems of signs.” (wikipedia, emphasis added) 6 Meaning not deep thought • “Information architecture is the categorization of information into a coherent structure, preferably one that the most people can understand quickly, if not inherently. • It's usually hierarchical, but can have other structures, such as concentric or even chaotic.” (wikipedia) 7 An example – learning portal 8 And relation to design? • “In the context of information systems design, information architecture refers to the analysis and design of the data stored by information systems, concentrating on entities, their attributes, and their interrelationships. • It refers to the modeling of data for an individual database and to the corporate data models an enterprise uses to coordinate the definition of data in several (perhaps scores or hundreds) of distinct databases. • The "canonical data model" is applied to integration technologies as a definition for specific data passed between the systems of an enterprise. • At a higher level of abstraction it may also refer to the definition of data stores.” (wikipedia) 9 Art or skill? • Form follows function (Sullivan) – who put this into effect in building structures, homes? • Based on two previous foundations classes, information theory and signs, it should be clear that the answer is ‘yes’ (both). 10 Design theory • Elements – Form – Value – Texture – Lines – Shapes – Direction – Size – Color • Relate these to previous class, signs and relations between them 11 Examples 12 Remember this one? 13 Principles of design • Balance – Balance in design is similar to balance in physics • Gradation – of size and direction produce linear perspective. – of color from warm to cool and tone from dark to light produce aerial perspective. – can add interest and movement to a shape. – from dark to light will cause the eye to move along a shape. • Repetition – with variation is interesting, without variation repetition can become monotonous. 14 Balance, gradation, repetition 15 Principles of design • Contrast – is the juxtaposition of opposing elements e.g. opposite colors on the color wheel - red / green, blue / orange etc. – in tone or value - light / dark. – in direction - horizontal / vertical. – The major contrast in a painting should be located at the center of interest. – Too much contrast scattered throughout a painting can destroy unity and make a work difficult to look at. – Unless a feeling of chaos and confusion are what you are seeking, it is a good idea to carefully consider where to place your areas of maximum contrast. 16 Contrast 17 Principles of design • Harmony – in painting is the visually satisfying effect of combining similar, related elements. e.g. adjacent colors on the color wheel, similar shapes etc. • Dominance – gives a scene interest, counteracting confusion and monotony – can be applied to one or more of the elements to give emphasis 18 Harmony, Dominance 19 Principles of design • Unity – Relating the design elements to the idea being expressed in a rendering reinforces the principal of unity. – E.g. a scene with an active aggressive subject would work better with a dominant oblique direction, course, rough texture, angular lines etc. whereas a quiet passive subject would benefit from horizontal lines, soft texture and less tonal contrast. – in a painting also refers to the visual linking of 20 various elements of the work. Unity 21 Color • Primary Colors - Red, Yellow, Blue - these colors should not be intermingled, they must be bought together in some other form • Secondary Color - Orange, Violet, Green, these colors are created by mixing two primaries. • Intermediate Colors - Red Orange, Yellow Green, Blue Violet, etc.; mixing a primary with a secondary creates these colors. • Complementary Colors - are colors that are opposite each other on the color wheel. When placed next to each other they look bright and when mixed together they neutralize each other. 22 Wheels 23 Color applied • Harmony is when an artist uses certain combinations of colors that create different looks or feelings • Analogous Colors are colors that are next to each other on the color wheel for example red, red orange, and orange are analogous colors. • Triadic Harmony is where three equally spaced colors on the color wheel are used for example, Yellow, Red, Blue is a triadic harmony color scheme. • Monochromatic is where one color is used but in different values and intensity. 24 Color applied • Warm colors are on one side of the color wheel and they give the felling of warmth for example red, orange and yellow are the color of fire and feel warm. • Cool colors are on the other side of the color wheel and they give the feeling of coolness for example blue, violet, are the color of water, and green are the color of cool grass. 25 Reference architectures • “provides a proven template solution for an architecture for a particular domain. It also provides a common vocabulary with which to discuss implementations, often with the aim to stress commonality. • A reference architecture often consists of a list of functions and some indication of their interfaces (or APIs) and interactions with each other and with functions located outside of the scope of the reference architecture.” (wikipedia) 26 U.S. Federal Enterprise Arch • E.g. The Federal Enterprise Architecture Reference Model Ontology (FEA-RMO) is a domain specific ontology of the Federal Enterprise Architecture reference models. • FEA-RMO directly translates the Performance, Business, Service Component, and Technical reference models into their executable representation in OWL-DL. – http://web-services.gov/fea-rmo.html 27 FEA Domain model 28 Figure 6-1 DRM Abstract Model 29 Data Description 30 Data Sharing 31 Data Context 32 MVC • Model • View • Controller 33 %&' ( )*$2,$/"3 45$- *. $4667&180&9: $4)1; &0*10( )*$ Internet/ Intranet • Communications versus information architecture? • http://www.slideshare.net/postwait/scalableinternet-architecture • See the reading for this week, RFC1958 and the role of the Internet Engineering Task Force (IETF) and the Request for Comment Process 34 A<C0, 2?$+=+F, J +:$. , $@<G, $1J 6K, J , 2F, ?$+, G, -<K$/ F@, -$+=+F, J $C/ J 6/ 2, 2F+$12$<??1F1/ 2$F/ $6-/ G1?, $F/ $ , 2@<2C, $F@, $+=+F, J $C<6<A1K1F1, +b$I/ -$, O<J 6K, >$C<C@123$+, -G, -+$I/ -$1J 6-/ G123$F@, $-, +6/ 2+, $F1J , $I/ -$<KK$F@, $ H, A$6<3, +$12CK5?123$J <6$3<KK, -=$<2?$F@, $12F, -<CF1G, $G1+5<K1V<F1/ 2$F/ / K:$( $C/ J 6K, F, $+=+F, J $<-C@1F, CF5-, $ +@/ H123$F@, $G<-1/ 5+$C/ J 6/ 2, 2F+$<2?$?<F<$IK/ H$1+$1KK5+F-<F, ?$12$7135-, $W:$ E.g. 35 $ %&'( )*$<,$/=>0*? $4)1; &0*10( )*$/( 669)0&: ' $- *. >&0*$3 *@*796? *: 0$ WWW • Design for the web (Tim Berners Lee) • “Principles such as simplicity and modularity are the stuff of software engineering; decentralization and tolerance are the life and breath of Internet. To these we might add the principles of least powerful language, and the test of independent invention when considering evolvable Web technology.” 36 Original design issues • See http://www.w3.org/DesignIssues/Overview.html • Here are the criteria and features to be considered: – Intended uses of the system. – Availability on which platforms? – Navigational techniques and tools: browsing, indexing, maps, resource discovery, etc – Keeping track of previous versions of nodes and their relationships – Multiuser access: protection, editing and locking, annotation. – Notifying readers of new material available – The topology of the web of links – The types of links which can express different relationships between nodes 37 Original design issues • These are the three important issues which require agreement between systems which can work together – Naming and Addressing of documents – Protocols – The format in which node content is stored and transferred • Implementation and optimization – Caching , smart browsers, knowbots etc., format conversion, gateways 38 Web architectural elements URI HTML HTTP 39 Common Gateway Interface 40 Client – Server and multi tier 41 Web page/site architecture • Hierarchies, we call them levels: – Top level (the main page) – Second (and further) level (via navigation) – Balancing the levels • Remember your use case, the actors, the resources, the information model, information entropy, the signs, ... 42 CEDAR 1.0 circa 1990 43 CEDAR 2.0 circa 1994 44 2000 45 46 Multi-tiered Interoperability People Agency Policy Makers System Scientists Politicians Decision-level semantic mediation: high-level vocabularies that facilitate policy-level decision-making Integrated Applications Inter-disciplinary Data Visualization Apps Semantic interoperability Integration Frameworks & Methodologies Eco & other system Assessment Apps Application-level semantic mediation: mid-level vocabularies that facilitate the interoperability of system models and data products Sof t ware, Tools&Apps Disciplinespecific model(s) Semantic interoperability Dataproduct Generator Semantic query, hypothsis and inference Information/ Science Apps Query, access and use of data Data-level Semantic mediation: lower-level vocabularies applied to each data source for a specific science domain of interest Data Repositories Federal Repository Commercial Database Researcher Private Database Other Data Sources Metadata, schema, data ... ... ... Grid • “One of the main strategies of Grid computing is to use middleware to divide and apportion pieces of a program among several computers, sometimes up to many thousands. • Grid computing involves computation in a distributed fashion, which may also involve the aggregation of large-scale cluster computing based systems.” (wikipedia) 48 “What is the Grid?” • A Three Point Checklist, Ian Foster lists these primary attributes: – Computing resources are not administered centrally – Open standards are used. – Nontrivial quality of service is achieved. 49 Open Grid Services Architecture 50 Statefull versus stateless • A key distinction between Grids and Web environments is state, i.e. the knowledge of ‘who’ knows and remembers ‘what’ • Increasingly there is a need for maintaining some form of state, i.e. reducing information entropy in web and internet-based architectures • Thus, enter the need for ‘state for a defined purpose’… 51 Cloud • "a computing capability that provides an abstraction between the computing resource and its underlying technical architecture (e.g., servers, storage, networks), enabling convenient, on-demand network access to a shared pool of configurable computing resources that can be rapidly provisioned and released with minimal management effort or service provider interaction.” (wikipedia) • Logical extension of virtualization 52 • Often tied to the cost model Primary Benefits of Cloud Computing • To deliver a future state architecture that captures the promise of Cloud Computing, architects need to understand the primary benefits of Cloud computing • Decoupling and separation of the business service from the infrastructure needed to run it (virtualization) • Flexibility to choose multiple vendors that provide reliable and scalable business services, development environments, and infrastructure that can be leveraged out of the box and billed on a metered basis—with no long term contracts • Elastic nature of the infrastructure to rapidly allocate and de-allocate massively scalable resources to business services on a demand basis • Cost allocation flexibility for customers wanting to move CapEx into OpEx • Reduced costs due to operational efficiencies, and more rapid deployment of new business services 53 Software as a service (SaaS) • A SaaS provider typically hosts and manages a given application in their own data center and makes it available to multiple tenants and users over the Web. • Some SaaS providers run on another cloud provider’s PaaS or IaaS service offerings. • Oracle CRM On Demand, Salesforce.com, and Netsuite are some of the well known SaaS 54 Infrastructure as a service (IaaS) • is the delivery of hardware (server, storage and network), and associated software (operating systems virtualization technology, file system), as a service. It is an evolution of traditional hosting that does not require any long term commitment and allows users to provision resources on demand. • Unlike PaaS services, the IaaS provider does very little management other than keep the data center operational and users must deploy and manage the software services themselves--just the way they would in their own data center. Amazon Web Services Elastic Compute Cloud (EC2) and Secure Storage Service (S3) are examples of IaaS offerings. 55 Platform as a service (Paas) • is an application development and deployment platform delivered as a service to developers over the Web. • facilitates development and deployment of applications without the cost and complexity of buying and managing the underlying infrastructure, providing all of the facilities required to support the complete life cycle of building and delivering web applications and services entirely available from the Internet. • consists of infrastructure software, and typically includes a database, middleware and development tools. • A virtualized and clustered grid computing architecture is often the basis for this infrastructure software. 56 Platform as a service (Paas) • Some PaaS offerings have a specific programming language or API. • For example, Google AppEngine is a PaaS offering where developers write in Python or Java. • EngineYard is Ruby on Rails. • Sometimes PaaS providers have proprietary languages like force.com from Salesforce.com and Coghead, now owned by SAP 57 Simple cloud architectures 58 More complex clouds 59 More details… 60 Cloud domain decompostion By functional domain 61 Towards a reference architecture? 62 Discussion • About architecture in general? • Design? • Internet, web, grid, cloud? 63 Project Assignment • A) Analysis of existing information system content and architecture, critique, redesign and prototype redeployment • B) Pursuit of a detailed use case around a particular area of informatics, includes developing a prototype IS, architecture, design, etc. • Due April 30 (write up) and May 7 (presentation) • That’s 5 (6) weeks • Check in on progress in ~ 2 weeks • Will set aside class time to meet 64 Teams (*tentative*) • IR: Tom, Ameya, Liuxun Zhang, Konstantin, Ledong. • Red: Steven, Tao Chen, Cynthia, Aaron, Rahul • Orange: Brendan, Andrew, Tommy, Aishwarya • Yellow: Daniel, Aaditya, Waqas, Simon • Green: Alexander, Tyson, Di Wu, Jianing Zhang • Blue: Jerry, Chengcong, Tejaswini, Matt F, TJ. • Indigo: Justin, Xiaoyu Lei, Anusha, Chen • Violet: Katie, Jianguo Zhong, Jeremy, John 65 Let’s look at these: • http://www.nws.noaa.gov/ • http://www.nodc.noaa.gov/, http://www.ngdc.noaa.gov/, http://www.ncdc.noaa.gov/ • http://www.bco-dmo.org/ • http://giovanni.gsfc.nasa.gov • http://mirador.gsfc.nasa.gov • 50 best web sites of 2012 • http://www.coolhomepages.com/ • Worst web sites… 66 What is next • Reading for this week – Design, color – Architectures – Life cycle • Week 9 – Information Integration, Life-cycle and Visualization • Week 10 – Information Management, Workflow, and Discovery and project check in/ discussion 67