Transcript Essential Software Architecture
Essential Software Architecture
Session 1: Introduction to Software Architecture 1
Session Outline
IT Systems Application Domain What is Software Architecture?
Definitions What does an architect do?
Architectures and technologies Further reading 2
IT Systems Application Domain
IT systems are everywhere Banks Shops Internet sites Large, complex, heterogeneous, distributed applications Use commercial-of-the-shelf middleware, databases, web servers, application packages Major problems are architecture design, technology selection, application and business integration 3
What is Software Architecture?
It’s about software design All architecture is software design, but not all design is software architecture Part of the design process Simply, architecture focuses on ‘issues that will be difficult/impossible to change once the system is built’ Quality attributes like security, performance Non-functional requirements like cost, deployment hardware More on these later in this session 4
Defintions - ANSI/IEEE Std 1471-2000
“Architecture is the fundamental organization of a system, embodied in its components, their relationships to each other and the environment, and the principles governing its design and
evolution
.”
5
Definitions - SEI
“The software architecture of a program or computing system is the structure or structures of the system, which comprise software elements, the externally visible properties of those elements, and the relationships among them.
” 6
Definitions – Garlan and Shaw
“[Software architecture goes] beyond the algorithms and data structures of the computation; designing and specifying the overall system structure emerges as a new kind of problem. Structural issues include gross organization and global control structure; protocols for communication, synchronization, and data access; assignment of functionality to design elements; physical distribution; composition of design elements; scaling and performance; and selection among
design alternatives.”
7
Architecture Defines Structure
Decomposition of system in to components/modules/subsystems Architecture defines: Component interfaces What a component can do Component communications and dependencies How components communicate Component responsibilities Precisely what a component will do when you ask it 8
Structure and Dependencies
Excessive component dependencies are bad!
Key architecture issue Identifying components that may change Reduce direct dependencies on these components Creates more modifiable systems C1 C2 C3 C4 Third Party Component C1 C2 C3 C4 AL Third Party Component Four components are directly dependent on a third party component. If the third party component is replaced with a new component with a different interface, changes to each component are likely.
C
Diagram Key
Component Only the AL (abstraction layer) component is directly dependent on the third party component. If the third party component is replaced, changes are restricted to the AL component only Dependency 9
Architecture Specifies Component Communication
Communication involves: Data passing mechanisms, e.g.: Function call Remote method invocation Asynchronous message Control flow Flow of messages between components to achieve required functionality Sequential Concurrent/parallel Synchronization 10
Architecture Patterns/Styles
Patterns catalogue successfully used structures that facilitate certain kinds of component communication client-server Message broker Pipeline Patterns have well-known characteristics appropriate for particular types of requirements Patterns are very useful things … Reusable architectural blueprints Help efficiently communicate a design Large systems comprise a number of individual patterns “Patterns and Styles are the same thing – the patterns people won” [anonymous SEI member] 11
Architecture addresses NFRs
Non-functional requirements (NFRs) define ‘how’ a system works NFRs rarely captured in functional requirements Aka. architecture requirements Must be elicited by architect NFRs include: Technical constraints Business constraints Quality attributes 12
Architecture is an Abstraction
Architecture provides an abstract view of a design Hides complexity of design May or may not be a direct mapping between architecture elements and software elements Example – A Marketecture informal depiction of system’s structure and interactions. portray the design philosophies embodied in the architecture Every system should have a marketecture: Easy to understand Helps discussion during design, build, review, sales (!) process 13
Decomposition
Hierarchical decomposition is a powerful abstraction mechanism Partitions design Allocate components to development teams Why isn’t the
Client
component decomposed?
Top Level Architecture Description Client Broker Server Messag e Handler Director y Server Security Server Request Handler Data Store Diagram Key
C Component Dependency 14
Architecture Views
A software architecture represents a complex design artifact Many possible ‘views’ of the architecture Cf. with buildings – floor plan, external, electrical, plumbing, air-conditioning 15
Philippe Krutchen - 4+1 View Model
Logical view:
describes architecturally significant elements of the architecture and the relationships between them.
Process view:
describes the concurrency and communications elements of an architecture.
Physical view:
depicts how the major processes and components are mapped on to the applications hardware.
Development view:
software components as held in e.g. a configuration management tool. captures the internal organization of the
Architecture use cases:
capture the requirements for the architecture; related to more than one particular view 16
SEI – Views and Beyond
Module:
structural view of the architecture, comprising the code modules such as classes, packages and subsystems
Component and Connector:
describes the behavioral aspects of the architecture. Components are objects, threads or processes, and connectors describe how components interact.
Allocation:
shows how the processes are mapped to hardware and gives a view of the source code in the configuration management systems.
This course will focus on
structural
views and
behavioral
17
What does an Architect do?
Many responsibilities: Liaison with stakeholders Technology knowledge Software engineering Risk managements Long list at: http://www.sei.cmu.edu/ata/arch_duties.html 18
Architectures and Technologies
Architects reduce risks by using proven design patterns Must map abstract design pattern to concrete implementation Software vendors have created (COTS) technologies that explicitly support widely used patterns Makes implementation of patterns easier Reduces risk if technology is well built 19
Architectures and Technologies
Abstract
Architectural Patterns/Styles Application Servers Messaging Message Brokers Object Brokers
Concrete COTS technologies
Process Orchestration
Each technology has multiple vendors/open source versions Architects need to choose technology wisely Proof of concept prototype Detailed feature evaluation 20
Summary
“
The life of a software architect is a long (and sometimes painful) succession of sub-optimal decisions made partly in the dark”
Architecture involves complex design decisions Architect role is much more than just technical design Made harder by early lifecycle nature of much of the design 21
Selected Further Reading
L. Bass, P. Clements, R Kazman. Software Architecture in Practice, Second Edition. Addison Wesley, 2003.
R. Wirfs-Brock, A. McKean. Object Design: Roles, Responsibilities, and Collaborations. Addison Wesley, 2002.
M. Fowler. Patterns of Enterprise Application Architecture. Addison-Wesley, 2002.
I.Gorton, A Liu.
Performance Evaluation of Alternative Component Architectures for Enterprise JavaBean Applications,
in IEEE Internet Computing, vol.7, no. 3, pages 18-23, 2003.
22
Essential Software Architecture
Session 2: Introduction to the Case Study 23
ICDE System
Information Capture and Dissemination Environment (ICDE) is a software system for providing intelligent assistance to financial analysts scientific researchers intelligence analysts analysts in other domains 24
ICDE Schematic
Internet Local information repositories ICDE Repository Analyst ICDE Recording Software 3 rd Party Tools 25
ICDE Use Cases
Analyst * *
ICDE
* Capture User Actions * * * 3rd Party Tools * User Assistance * * Query User Actions * * * Data Store 26
Case Study Context
ICDE version 1.0 in production Basically a complex,
raw
information capture tool, GUI for looking at captured data 2 tier client-server, single machine deployment Java, Perl, SQL, Programmatic access to data through very complex SQL (38 tables, 46 views) 27
ICDE version 2.0
ICDE v2.0 scheduled for development in 12 month timeframe Fixed schedule, budget Major changes to: Enhance data capture tools (GUI) Support 3 rd party tool integration, testing, data access and large production scale deployments (100’s of users) Very few concrete requirements for the 3 rd party tool support or release to full production environment 28
ICDE v2.0 Business Goals
Business Goal
Encourage third developers party tool
Supporting Technical Objective
Simple and reliable programmatic access to data store for third party tools Heterogeneous (i.e.
non-Windows) support for running third party tools platform Allow third party tools to communicate with ICDE users from a remote machine Promote the ICDE concept to users Scale the data collection and data store components to support up to 150 users at a single site Low-cost deployment for each ICDE user workstation 29
Architecturally Significant Requirements for ICDE v2.0
ICDE project requirements: Heterogeneous platform support for access to ICDE data Instantaneous event notification (local/distributed) Over the Internet, secure ICDE data access Ease of programmatic data access ICDE Project team requirements: Insulate 3 rd evolution party projects and ICDE tools from database Reliability for multi-tool ICDE deployments Scalable infrastructure to support large, shared deployments Minimize license costs for a deployment Unknowns Minimize dependencies, making unanticipated changes potentially easier 30
Summary
ICDE is a reasonably complex system Will be used to illustrate concepts during the remainder of this course 31
Essential Software Architecture
Session 3: Quality Attributes 32
What are Quality Attributes
Often know as –ilities Reliability Availability Portability Scalability Performance (!) Part of a system’s NFRs “how” the system achieves its functional requirements 33
Quality Attribute Specification
Architects are often told: “My application must be fast/secure/scale” Far too imprecise to be any use at all Quality attributes (QAs) must be made precise/measurable for a given system design, e.g.
“It must be possible to scale the deployment from an initial 100 geographically dispersed user desktops to 10,000 without an increase in effort/cost for installation and configuration.”
34
Quality Attribute Specification
QA’s must be concrete But what about testable?
Test scalability by installing system on 10K desktops?
Often careful analysis of a proposed solution is all that is possible “It’s all talk until the code runs” 35
Performance
Many examples of poor performance in enterprise applications Performance requires a: Metric of amount of work performed in unit time Deadline that must be met Enterprise applications often have strict performance requirements, e.g.
1000 transactions per second 3 second average latency for a request 36
Performance - Throughput
Measure of the amount of work an application must perform in unit time Transactions per second Messages per minute Is required throughput: Average?
Peak?
Many system have low average but high peak throughput requirements 37
Throughput Example
CPU % MST (msp) 300 250 200 150 100 50 0 0 5 10 # of threads 15 20 Throughput of a message queuing system Messages per second (msp) Maximum sustainable throughput (MST) Note throughput changes as number of receiving threads increases 38
Performance - Response Time
measure of the latency an application exhibits in processing a request Usually measured in (milli)seconds Often an important metric for users Is required response time: Guaranteed?
Average?
E.g. 95% of responses in sub-4 seconds, and all within 10 seconds 39
Response Time
Example shows response time distribution for a J2EE application 40
Performance - Deadlines
‘something must be completed before some specified time’ Payroll system must complete by 2am so that electronic transfers can be sent to bank Weekly accounting run must complete by 6am Monday so that figures are available to management Deadlines often associated with batch jobs in IT systems.
41
Something to watch for …
What is a Transaction?
Message?
Request?
All are application specific measures.
System must achieve 100 mps throughput BAD!!
System must achieve 100 mps peak throughput for
PaymentReceived
messages GOOD!!!
42
ICDE Performance Issues
Response time: Overheads of trapping user events must be imperceptible to ICDE users Solution for ICDE client: Decouple user event capture from storage using a queue 5. Write event to ICDE database queue 1. Trap user event 2. Write event to queue 3. Return to user thread 4. Read event from queue 43
Scalability
“How well a solution to some problem will work when the size of the problem increases.”
4 common scalability issues in IT systems: Request load Connections Data size Deployments 44
Scalability – Request Load
How does an 100 tps application behave when simultaneous request load grows? E.g.
From 100 to 1000 requests per second?
Ideal solution, without additional hardware capacity: as the load increases, throughput remains constant (i.e. 100 tps), and response time per request increases only linearly (i.e. 10 seconds). 45
Scalability – Add more hardware …
Scale-up: Single application instance is executed on a multiprocessor machine Scale-out: Application replicated on different machines
Application Application CPU Application Application Application 46
Scalability - reality
Adding more hard ware should improve performance: scalability must be achieved without modifications to application architecture Reality as always is different!
Applications will exhibit a decrease in throughput and a subsequent exponential increase in response time. increased load causes increased contention for resources such as CPU, network and memory each request consumes some additional resource (buffer space, locks, and so on) in the application, and eventually these are exhausted 47
Scalability – J2EE example
2500 2000 1500 1000 500 0 0 200 400 600 800
No. of Clients
1000 1200 WAS SB JBoss SB IAS SB SS SB WLS SB BES SB I.Gorton, A Liu,
Performance Evaluation of Alternative Component Architectures for Enterprise JavaBean Applications, in IEEE Internet Computing, vol.7, no. 3, pages 18-23, 2003.
48
Scalability - connections
What happens if number of simultaneous connections to an application increases If each connection consumes a resource?
Exceed maximum number of connections?
ISP example: Each user connection spawned a new process Virtual memory on each server exceeded at 2000 users Needed to support 100Ks of users Tech crash ….
49
Scalability – Data Size
How does an application behave as the data it processes increases in size? Chat application sees average message size double?
Database table size grows from 1 million to 20 million rows?
Image analysis algorithm processes images of 100MB instead of 1MB? Can application/algorithms scale to handle increased data requirements?
50
Scalability - Deployment
How does effort to install/deploy an application increase as installation base grows?
Install new users?
Install new servers?
Solutions typically revolve around automatic download/installation E.g. downloading applications from the Internet 51
Scalability thoughts and ICDE
Scalability often overlooked.
Major cause of application failure Hard to predict Hard to test/validate Reliance on proven designs and technologies is essential For ICDE - application should be capable of handling a peak load of 150 concurrent requests from ICDE clients.
Relatively easy to simulate user load to validate this 52
Modifiability
Modifications to a software system during its lifetime are a fact of life. Modifiable systems are easier to change/evolve Modifiability should be assessed in context of how a system is likely to change No need to facilitate changes that are highly unlikely to occur Over-engineering!
53
Modifiability
Modifiability measures how easy it
may
be to change an application to cater for new (non-) functional requirements.
‘may’
– nearly always impossible to be certain Must estimate cost/effort Modifiability measures are only relevant in the context of a given architectural solution. Components Relationships Responsibilities 54
Modifiability Scenarios
Provide access to the application through firewalls in addition to existing “behind the firewall” access.
Incorporate new features for self-service check-out kiosks.
The COTS speech recognition software vendor goes out of business and we need to replace this component.
The application needs to be ported from Linux to the Microsoft Windows platform.
55
Modifiability Analysis
Impact is rarely easy to quantify The best possible is a: Convincing impact analysis of changes needed A demonstration of how the solution can accommodate the modification without change. Minimizing dependencies increases modifiability Changes isolated to single components likely to be less expensive than those that cause ripple effects across the architecture. 56
Modifiability for ICDE
The range of events trapped and stored by the ICDE client to be expanded. Third party tools to communicate new message types. Change database technology used Change server technology used 57
Security
Difficult, specialized quality attribute: Lots of technology available Requires deep knowledge of approaches and solutions Security is a multi faceted quality … 58
Security
Authentication:
Applications can verify the identity of their users and other applications with which they communicate.
Authorization:
Authenticated users and applications have defined access rights to the resources of the system.
Encryption:
encrypted. The messages sent to/from the application are
Integrity:
This ensures the contents of a message are not altered in transit.
Non-repudiation:
The sender of a message has proof of delivery and the receiver is assured of the sender’s identity. This means neither can subsequently refute their participation in the message exchange.
59
Security Approaches
SSL PKI Web Services security JAAS Operating system security Database security Etc etc 60
ICDE Security Requirements
Authentication of ICDE users and third party ICDE tools to ICDE server Encryption of data to ICDE server from 3 rd party tools/users executing remotely over an insecure network 61
Availability
Key requirement for most IT applications Measured by the proportion of the required time it is useable. E.g.
100% available during business hours No more than 2 hours scheduled downtime per week 24x7x52 (100% availability) Related to an application’s reliability Unreliable applications suffer poor availability 62
Availability
Period of loss of availability determined by: Time to detect failure Time to correct failure Time to restart application Strategies for high availability: Eliminate single points of failure Replication and failover Automatic detection and restart Recoverability (e.g. a database) the capability to reestablish performance levels and recover affected data after an application or system failure 63
Availability for ICDE
Achieve 100% availability during business hours Plenty of scope for downtime for system upgrade, backup and maintenance. Include mechanisms for component replication and failover 64
Integration
ease with which an application can be incorporated into a broader application context Use component in ways that the designer did not originally anticipate Typically achieved by: Programmatic APIs Data integration 65
Integration Strategies
Interoperability through an API facade
API Application Data Third Party Application
Interoperability achieved by direct data access
Data – expose application data for access by other components API – offers services to read/write application data through an abstracted interface Each has strengths and weaknesses … 66
ICDE Integration Needs
Revolve around the need to support third party analysis tools. Well-defined and understood mechanism for third party tools to access data in the ICDE data store. 67
Misc. Quality Attributes
Portability Can an application be easily executed on a different software/hardware platform to the one it has been developed for? Testability How easy or difficult is an application to test? Supportability How easy an application is to support once it is deployed?
68
Design Trade-offs
QAs are rarely orthogonal They interact, affect each other highly secure system may be difficult to integrate highly available application may trade-off lower performance for greater availability high performance application may be tied to a given platform, and hence not be easily portable Architects must create solutions that makes sensible design compromises not possible to fully satisfy all competing requirements Must satisfy all stakeholder needs This is the difficult bit!
69
Summary
QAs are part of an application’s non functional requirements Many QAs Architect must decide which are important for a given application Understand implications for application Understand competing requirements and trade offs 70
Selected Further Reading
L. Chung, B. Nixon, E. Yu, J. Mylopoulos, (Editors). Non-Functional Requirements in Software Engineering Series: The Kluwer International Series in Software Engineering. Vol. 5, Kluwer Academic Publishers. 1999. J. Ramachandran. Designing Security Architecture Solutions. Wiley & Sons, 2002.
I.Gorton, L. Zhu. ACM Press
Tool Support for Just-in-Time Architecture Reconstruction and Evaluation: An Experience Report
. International Conference on Software Engineering (ICSE) 2005, St Loius, USA, 71
Essential Software Architecture
Session 4: A Guide to Middleware Architectures and Technologies 72
Introduction
Middleware is the plumbing or wiring of IT applications Provides applications with fundamental services for distributed computing Insulates applications from underlying platform (OS, DBMS, etc) APIs Lots of middleware exists Different purposes Different vendors Different standards and proprietary technologies 73
Middleware Classification
Business Process Orchestrators Message Brokers Application Servers Transport
BizTalk, TIBCO StaffWare, ActiveBPEL BizTalk, WebSphere Message Broker, SonicMQ J2EE, CCM, .NET
Message-Oriented Middleware, Distributed Objects Systems
74
Outline
CORBA Message-oriented middleware J2EE Message brokers Business process orchestrators 75
CORBA
Venerable distributed object technology Still widely used in telecomms, defense Many different implementations
Client
Object Reference
request
client ORB Server
Servant
reply
server ORB Network
76
CORBA Code Example
module ServerExample { interface MyObject { string isAlive(); }; };
Server CORBA IDL
class MyServant extends _MyObjectImplBase { public String isAlive() { return "\ nLooks like it…\n"; } } ORB orb = ORB.init(args, null); MyServant objRef = new MyServant(); orb.connect(objRef); ORB orb = ORB.init(args, null); // Lookup is a wrapper that actually access the CORBA Naming // Service directory – details omitted for simplicity MyServant servantRef = lookup(“Myservant”)String reply = servantRef.isAlive();
Client
77
CORBA – Some Thoughts
Many associated services, eg Naming Notification Transactions Synchronous technology, client-server relatively tightly coupled Remote calls can/will fail State management in server objects creates ‘interesting’ recovery issues 78
Messaging - MOM
Basic Message Oriented Middleware (MOM) provides features like: Asynchronous communications between processes, applications and systems Send-and-forget Delivering messages despite failures Transactional Messaging Deliver all messages in a transaction, or none Persistence Messages can be logged at the server and hence survive server failure 79
Basic Messaging
Send (queue, message) Put message onto queue Receive (queue, message) Get message from queue No dependency on state of receiving application on message send send queue receive 80
Persistence
send queue receive Receipt of message at queue implies message is written to disk log Removal of message from queue deletes message from disk log Trade-off performance versus reliability 81
MOM Server
Senders Sending Applications
Message Handler Thread Pool
MOM Server Receivers Sending Applications
Peer-to-peer MOM technologies are the alternative design 82
MOM Transactions
Begin transaction ... update database record put message on queue ... commit transaction Begin transaction ... get message from queue update database record ... commit transaction 83
MOM Transactions
Sender and receiver do *not* share a transaction Rollback on receiver does not affect the sender (already committed) ‘Synchronous’ operations are not atomic Request/response is 3 transactions not 1 Put to request queue Get from request queue, put to response queue Get from response queue Request queue send receive receive Response queue send 84
Scaling MOM
MOM Server ApplicationQ Senders MOM Server ApplicationQ Receivers
85
Messaging – Some thoughts
Highly attractive asynchronous technology Supports loosely-coupled, dynamic applications Scales well, high throughput possible Many implementations, various qualities of service caveat emptor 86
Publish-Subscribe Messaging
Extension of MOM to provide 1-to-N, N-to-1, and N-to N communications Messages are ‘published’ to logical
subjects
or
topics
Subscribers receive all messages from subjects they subscribe to Sub Pub
Create/ Publish
Subject
Register/ Subscribe
Sub Sub 87
Publish-Subscribe with Multicast
Subscriber rvd Subscriber rvd rvrd Publisher rvd rvrd
Based on TIBCO Rendezvous
rvd Subscriber rvd Subscriber rvd Subscriber
88
Performance
Milliseconds 700 600 500 400 300 200 100 0 10 20 30 40 50 No. Of Subscribers MC1 MC2 QB
89
Subject/Topic Naming
Sydney DevGroup SupportGroup Information Information work gossip Sydney Sydney/DevGroup Sydney/DevGroup/Information Sydney/DevGroup/Information/work Sydney/DevGroup/Information/gossip Sydney/SupportGroup Sydney/SupportGroup/Information Sydney/SupportGroup/Information/work Sydney/SupportGroup/Information/gossip work gossip Sydney/*/Information Sydney/DevGroup/*/* Sydney/DevGroup/** 90
Publish-Subscribe – Some Thoughts
Highly decoupled messaging style Publishers don’t know about subscribers Subscribers don’t know who is publishing Publishers and Subscribers can dynamically appear and disappear Issues – Reliability Transactions Security Performance 91
J2EE Overview
Client tier Web Tier Business Component tier EIS Tier Browser-based client applications (HTML, applets, DHTML/scripting) HTTP Web server Servlets, JSPs RMI Application components EJBs JCA Java client applications Java RMI Container Services Components eg. JTS, JMS ERPs, CRMs, Mainframe TP systems JDBC RDBMS CAS COM Bridge, RMI over IIOP Windows/COM client applications 92
J2EE Application Server
In J2EE, the application server container provides the execution environment for the J2EE-specific components EJBs Message-driven beans Connectors Container provides additional services for hosted components Transactions Security Directory Threading Connection pooling 93
EJB Container
EJB Container EJB Pool Application Server
Transaction Service Lifecycle Management Persistence Connection Pool Thread Pool Directory Service Security Service 94
Beans and State
EJB Container
Stateless bean pool state state state state
EJB Clients
Stateful beans state state state 95
Deployment Descriptors
96
J2EE – Some Thoughts
Standards-based, multiple vendors, portable Good open source technology available Quality of implementations varies considerably Java only, wide platform support Performance is good, but varies between suppliers Scalable, fail over support through clustering Good integration with web technologies Supports various design patterns, flexible but more complex (e.g. stateful beans/scalability, entity beans) Standards evolving, need to monitor 97
Message Brokers - Motivation
Key:
Message = Web Component In-format In-format Legacy System #1 In-format Legacy System #2 In-format Legacy System #3 In-format Legacy System #4 In-format Queue Read Message Transform Legacyformat API call 98
What if …
the common
In-format
changes?
message format any legacy system API changes?
any of the transformations needs modifying? 99
Alternative Solution
Key:
Message = L1-format Legacy System #1 L2-format Legacy System #2 Web Component In-format Message Broker L3-format Legacy System #3 L4-format Legacy System #4 Transformations in broker Simplified endpoints Decouples Web and legacy components 100
Message Brokers
Developed specifically for Enterprise Application Integration (EAI) Add new layers of functionality to MOM Message transformation Rules engine Intelligent routing Adapters Typically (but not necessarily) built on top of a MOM layer 101
Message Broker Features
Message transformation – transform between different source/target formats Graphical message format definition and mapping tools High performance transformation engines Message format repositories Intelligent routing Route messages based on message content Rules Engine Scripting language, built-in functions Application programming environment 102
Message Brokers
Input Messages Hub and Spoke Architecture Transformation Routing Rules Processing Output Messages 103
Example - WMQI
104
BizTalk Mapping Tool
105
Adapters
An adapter is a component that resides between the message broker and the source/target systems Simplify complexity of end system interface through an abstraction layer Thin adapters - simple wrappers Thick adapters Programmable Abstract representation of services and meta-data Centralized adapters co-located with broker Distributed adapters execute in own process and may be located with source/target system 106
Message Brokers – Some Thoughts
Embeds transformations/routing in broker Can get complex Possible scaling issues Need to replicate brokers Failure handling Lightweight, rarely designed to recover from failure Often proprietary technology Good open source, standards-based like Mule now available 107
Business Process Orchestration
Commonly known as workflow Aim is to automate business processes which need to access data and business logic across disparate back-end applications Builds on EAI to ensure business processes are executed in the defined order using the required data Builds on middleware providing: Process execution engine Visual process definition tools Process monitoring tools 108
Typical Scenario
Business process automation Siebel Accounts Payable Customer Purchasing Sales desk Accounts Receivable Oracle SAP Customer Receiving Credit Validation Shipping 109
Example - BizTalk
110
BPO Architecture
Adapter Message Broker Adapter Adapter 111
BPEL
Web Services standard for describing workflows Many design and execution tools Eg ActiveBPEL Version 2.0 is a significant improvement 112
Integration Issues – Point-to-Point
Point-to-Point evolution Spaghetti architecture, hard to modify potentially (N 2 -N) interfaces
1 business process = 4 interfaces 5 business processes = 20 interfaces
113
Broker Spaghetti
No free lunch … Just relocates the spaghetti
message broker
114
Enterprise Data Model
Source sends message to target with common message format as payload.
Target receives message and transforms common format into its own local data representation.
2xN transformations, no broker needed Getting agreement is the tough bit …
Enterprise Data Model
115
Summary
Middleware: makes building complex, distributed, concurrent applications simpler.
institutionalizes proven design practices by supporting them in off-the-shelf middleware technologies.
Architect’s job is to ‘mix n’match’ technologies to create appropriate solutions Analyze trade-offs Open-minded (no hammer/nail thinking) No good/evil, its just technology 116
Essential Software Architecture
Session 5: A Software Architecture Process 117
A Software Architecture Process
Architects must be versatile:
Work with the requirements team:
The architect plays an important role in requirements gathering by understanding the overall systems needs and ensuring that the appropriate quality attributes are explicit and understood.
Work with various application stakeholders:
Architects play a pivotal liaison role by making sure all the application’s stakeholder needs are understood and incorporated into the design.
Lead the technical design team:
Defining the application architecture is a design activity.
Work with the project management:
budgets, schedules Planning, estimates, 118
An Architecture Process
Highly iterative Can scale to small/large projects Determine Architectural Requirements Architecture Design Validation 119
Determine Architectural Requirements
Sometime called: architecturally significant requirements architecture use cases essentially the quality and non-functional requirements for a system. Functional Requirements Determine Architecture Requirements Stakeholder Requirements Architecture Requirements 120
Examples
A typical architecture requirement :
“Communications between components must be guaranteed to succeed with no message loss”
Some architecture requirements are constraints: “
The system must use the existing IIS-based web server and use Active Server Page to process web requests
” Constraints impose restrictions on the architecture and are (almost always) non-negotiable. They limit the range of design choices an architect can make. 121
Quality Attribute Requirements
Quality Attribute
Performance Security Resource Management Usability Availability Reliability Scalability Modifiability
Architecture Requirement
Application performance must provide sub-four second response times for 90% of requests.
All communications must be authenticated and encrypted using certificates.
The server component must run on a low end office-based server with 512MB memory.
The user interface component must run in an Internet browser to support remote users.
The system must run 24x7x365, with overall availability of 0.99.
No message loss is allowed, and all message delivery outcomes must be known with 30 seconds The application must be able to handle a peak load of 500 concurrent users during the enrollment period.
The architecture must support a phased migration from the current Forth Generation Language (4GL) version to a .NET systems technology solution.
122
Constraints
Constraint
Business Development
Architecture Requirement
The technology must run as a plug-in for MS BizTalk, as we want to sell this to Microsoft.
The system must be written in Java so that we can use existing development staff.
Schedule Business The first version of this product must be delivered within six months.
We want to work closely with and get more development funding from
MegaHugeTech Corp
, so we need to use their technology in our application.
123
Priorities
All requirements are not equal
High:
the application must support this requirement.
Medium:
stage this requirement will need to be supported at some
Low:
this is part of the requirements wish list. Tricky in face of conflicts, eg: Reusability of components in the solution versus rapid time-to-market. Making components generalized and reusable always takes more time and effort.
Minimal expenditure on COTS products versus reduced development effort/cost. COTS products mean you have to develop less code, but they cost money.
It’s design – not meant to be easy!
124
Architecture Design
Design steps are iterative Risk identification is a crucial output of the design Architecture Views Architecture Requirements Choose Architecture Framework Allocate Components Architecture Document 125
Choosing the Architecture Framework
Choose a architecture pattern/patterns that suit requirements No magic formula Analyze requirements and quality attributed supported by each pattern Complex architectures require creative blending of multiple patterns.
126
N-Tier Client Server Pattern
Separation of concerns:
Presentation, business and data handling logic are clearly partitioned in different tiers.
Synchronous communications:
Communications between tiers is synchronous request-reply. Each tier waits for a response from the other tier before proceeding.
Flexible deployment:
There are no restrictions on how a multi-tier application is deployed. All tiers could run on the same machine, or each tier may be deployed on its own machine. Client Tier Web Server Tier Business Logic Tier Data Management Tier Web Client Web Client Web Server Application Server Databases Web Client 127
N-Tier Client Server – Quality Attribute Analysis
Quality Attribute
Availability
Issues
Failure handling Modifiability Performance Scalability Servers in each tier can be replicated, so that if one fails, others remain available. Overall the application will provide a lower quality of service until the failed server is restored.
If a client is communicating with a server that fails, most web and application servers implement transparent failover. This means a client request is, without its knowledge, redirected to a live replica server that can satisfy the request.
Separation of concerns enhances modifiability, as the presentation, business and data management logic are all clearly encapsulated. Each can have its internal logic modified in many cases without changes rippling into other tiers.
This architecture has proven high performance. Key issues to consider are the amount of concurrent threads supported in each server, the speed of connections between tiers and the amount of data that is transferred. As always with distributed systems, it makes sense to minimize the calls needed between tiers to fulfill each request.
As servers in each tier can be replicated, and multiple server instances run on the same or different servers, the architecture scales out and up well. In practice, the data management tier often becomes a bottleneck on the capacity of a system.
128
Messaging Pattern
Asynchronous communications:
Clients send requests to the queue, where the message is stored until an application removes it.
Configurable QoS:
The queue can be configured for high-speed, non-reliable or slower, reliable delivery. Queue operations can be coordinated with database transactions.
Loose coupling:
direct binding between clients and servers. There is no
Client Client Client Que ue Server Server Server
129
Messaging – Quality Attribute Analysis
Quality Attribute
Availability Failure handling Modifiability Performance Scalability
Issues
Physical queues with the same logical name can be replicated across different messaging server instances. When one fails, clients can send messages to replica queues.
If a client is communicating with a queue that fails, it can find a replica queue and post the message there. Messaging is inherently loosely coupled, and this promotes high modifiability as clients and servers are not directly bound through an interface. Changes to the format of messages sent by clients may cause changes to the server implementations. Self-describing, discoverable message formats can help reduce this dependency on message formats.
Message queuing technology can deliver thousands of messages per second. Non reliable messaging is faster than reliable, with the difference dependent of the quality of the messaging technology used.
Queues can be hosted on the communicating endpoints, or be replicated across clusters of messaging servers hosted on a single or multiple server machines. This makes messaging a highly scalable solution.
130
Publish-Subscribe Pattern
Many-to-Many messaging:
Published messages are sent to all subscribers who are registered with the topic.
Configurable QoS:
In addition to non-reliable and reliable messaging, the underlying communication mechanism may be point-to point or broadcast/multicast.
Loose Coupling:
and subscribers. As with messaging, there is no direct binding between publishers
Publisher Topic Subscriber Subscriber Subscriber
131
Publish-Subscribe – Quality Attribute Analysis
Quality Attribute Issues
Availability Failure handling Topics with the same logical name can be replicated across different server instances managed as a cluster. When one fails, publishers send messages to replica queues.
If a publisher is communicating with a topic hosted by a server that fails, it can find a live replica server and send the message there. Modifiability Publish-subscribe is inherently loosely coupled, and this promotes high modifiability. New publishers and subscribers can be added to the system without change to the architecture or configuration. Changes to the format of messages published may cause changes to the subscriber implementations.
Performance Scalability Publish-subscribe can deliver thousands of messages per second, with non reliable messaging faster than reliable. If a publish-subscribe broker supports multicast/broadcast, it will deliver multiple messages in a more uniform time to each subscriber.
Topics can be replicated across clusters of servers hosted on a single or multiple server machines. Clusters of server can scale to provide very high message volume throughput. Also, multicast/broadcast solutions scale better than their point-to-point counterparts.
132
Broker Pattern
Hub-and-spoke architecture:
The broker acts as a messaging hub, and senders and receivers connect as spokes.
Performs message routing:
The broker embeds processing logic to deliver a message received on an input port to an output port.
Performs message transformation:
The broker logic transforms the source message type received on the input port to the destination message type required on the output port.
Sender-1 Sender-2
inPort1 inPort2
Broker
OutPort1
Receiver-1
OutPort2
Receiver-2
133
Broker Pattern - Quality Attribute Analysis
Availability
Quality Attribute
Failure handling Modifiability Performance Scalability
Issues
To build high availability architectures, brokers must be replicated. This is typically supported using similar mechanisms to messaging and publish-subscribe server clustering.
As brokers have typed input ports, they validate and discard any messages that are sent in the wrong format. With replicated brokers, senders can fail over to a live broker should one of the replicas fail.
Brokers separate the transformation and message routing logic from the senders and receivers. This enhances modifiability, as changes to transformation and routing logic can be made without affecting senders or receivers.
Brokers can potentially become a bottleneck, especially if they must service high message volumes and execute complex transformation logic. Their throughput is typically lower than simple messaging with reliable delivery.
Clustering broker instances makes it possible to construct systems scale to handle high request loads.
134
Process Coordinator Pattern
Process encapsulation:
The process coordinator encapsulates the sequence of steps needed to fulfill the business process. The sequence can be arbitrarily complex.
Loose coupling:
The server components are unaware of their role in the overall business process, and of the order of the steps in the process.
Flexible communications:
Communications between the coordinator and servers can be synchronous or asynchronous. Start process request step1
Server-1 Process Coordinator
step2 step3
Server-2
Process results
Server-3
step4
Server-4
135
Process Coordinator – Quality Attribute Analysis
Quality Attribute Issues
Availability The coordinator is a single point of failure. Hence it needs to be replicated to create a high availability solution.
Failure handling Modifiability Failure handling is complex, as it can occur at any stage in the business process coordination. Failure of a later step in the process may require earlier steps to be undone using compensating transactions. Handling failures needs careful design to ensure the data maintained by the servers remains consistent.
Process modifiability is enhanced because the process definition is encapsulated in the coordinator process. Servers can change their implementation without affecting the coordinator or other servers, as long as their external service definition doesn’t change.
Performance Scalability To achieve high performance, the coordinator must be able to handle multiple concurrent requests and manage the state of each as they progress through the process. Also, the performance of any process will be limited by the slowest step, namely the slowest server in the process.
The coordinator can be replicated to scale the application both up and out. 136
Allocate Components
Need to: Identify the major application components, and how they plug into the framework.
Identify the interface or services that each component supports.
Identify the responsibilities of the component, stating what it can be relied upon to do when it receives a request.
Identify dependencies between components.
Identify partitions in the architecture that are candidates for distribution over servers in a network And independent development 137
Some Design Guidelines
Minimize dependencies between components. Strive for a loosely coupled solution in which changes to one component do not ripple through the architecture, propagating across many components. Remember, every time you change something, you have to re test it.
Design components that encapsulate a highly “cohesive” set of responsibilities. Cohesion is a measure of how well the parts of a component fit together. Isolate dependencies on middleware and any COTS infrastructure technologies. Use decomposition to structure components hierarchically. Minimize calls between components, as these can prove costly if the components are distributed. 138
A Simple Design Example
read Write Order
OrderInput
New Orders Check Order Write Order Get Order
SendEmail Validate Store
Error Log
Customer System Order System Email Server Figure Key
Existing Component New Component Dependency Database Persistent Queue 139
Example Design
Based on messaging Application components are:
OrderInput:
encapsulating the order processing logic, and writing to the queue.
responsible for accessing the new orders database,
Validate:
encapsulates the responsibility of interacting with the customer system to carry out validation, and writing to the error logs if an order is invalid.
Store:
responsibility of interacting with the order system to store the order data.
SendEmail:
removes a message from the queue, formats an email message and sends it via an email server. It encapsulates all knowledge of the email format and email server access.
Clear responsibilities and dependencies 140
Architecture Validation
Aim of the validation phase is to increase confidence of the design team that the architecture is fit for purpose. The validation has to be achieved within the project constraints of time and budget The trick is to be as rigorous and efficient as possible.
Validating an architecture design poses tough challenges.
‘coz it’s a design that can’t be executed or tested consists of new and COTS components that have to be integrated Two main techniques: 1.
2.
manual testing of the architecture using test scenarios. construction of a prototype that creates a simple archetype of the desired application aim of both is to identify potential flaws in the design so that they can be improved before implementation commences. Cheaper to fix before built 141
Scenarios
Part of SEI’s ATAM work Involves defining: some kind of stimulus that will have an impact on the architecture. working out how the architecture responds to this stimulus. If the response is desirable, then a scenario is deemed to be satisfied by the architecture. If the response is undesirable, or hard to quantify, then a flaw or at least an area of risk in the architecture may have been uncovered.
142
Scenario Examples
Stimulus Quality Attribute
Availability Modifiability Security Modifiability Scalability
Response
The network connection to the message consumers fails. A new set of data analysis components must be made available in the application.
No requests are received on a user session for ten minutes. The supplier of the transformation engine goes out of business.
The concurrent user request load doubles during the 3 week enrollment period.
Messages are stored on the MOM server until the connection is restored. Messages will only be lost if the server fails before the connection comes back up.
The application needs to be rebuilt with the new libraries, and the all configuration files must be updated on every desktop to make the new components visible in the GUI toolbox.
The system treats this session as potentially insecure and invalidates the security credentials associated with the session. The user must logon again to connect to the application.
A new transformation engine must be purchased. The abstract service layer that wraps the transformation engine component must be re implemented to support the new engine. Client components are unaffected as they only use the abstract service layer.
The application server is scaled out on a two machine cluster to handle the increased request load. 143
Scenarios for Order Processing Example
Quality Attribute
Modifiability Availability
Stimulus Response
The
Customer System
application is updated to an Oracle database.
packaged The
Validate
component must be rewritten to interface to the Oracle system.
The email server fails.
Messages build up in the email server restarts. Messages are then sent by the not affected.
OrderQ SendEmail
until the component to remove the backlog. Order processing is
Reliability The Customer or Order systems are unavailable.
Needs fixing ….
If either fails, order processing halts and alerts are sent to system administrators so that the problem can be fixed.
144
Prototyping
Scenarios can’t address everything:
“On Friday afternoon, orders must be processed before close of-business to ensure delivery by Monday. Five thousand orders arrive through various channels (Web/Call centre/business partners) five minutes before close-of business.”
Only one way – build something!
Proof-of-concept prototype:
designed be built in a way that can satisfy the requirements?
Can the architecture as
Proof-of-technology prototype :
expected?
Does the technology (middleware, integrated applications, libraries, etc) selected to implement the application behave as 145
Prototyping Strategy
Build minimal system required to validate architecture, eg: An existing application shows that the queue and email systems are capable of supporting five thousand messages in five minutes So: Write a test program that calls the
Customer System
validation APIs five thousand times, and time how long this takes.
Write a test program that calls the
Order System
store APIs five thousand times, and time how long this takes.
146
Prototyping Thoughts
Prototypes should be used judiciously to help reduce the risks inherent in a design. Only way to address: Performance Scalability Ease of integration Capabilities of off-the-shelf components Need to be carefully scoped and managed. Ideally take a day or two, a week or two at most. Usually thrown-away so keep them cheap Don’t let them acquire a life of their own 147
Summary
3 step, iterative architecture design process Can be customized to small/meduim/large projects Agnostic to overall process framework (ie RUP, agile, waterfall, etc) 148
Essential Software Architecture
Session 6: Documenting a Software Architecture 149
Architecture Documentation
Architecture documentation is a thorny issue Commonly there is no documentation covering the architecture. If it is, it’s out-of-date, inappropriate and basically not very useful. Also projects that have masses of architecture related information Sometimes invaluable, but often it’s out-of-date, inappropriate and not very useful! 150
Documenting an Architecture is good!
Others can understand/evaluate the design. We can understand the design after a period of time.
Others in the project team and development organization can learn from the architecture.
We can do analysis on the design, perhaps to assess its likely performance, or to generate standard metrics.
151
But it’s difficult …
No universally accepted architecture documentation standard. An architecture can be complex, and documenting it in a comprehensible manner is time consuming and non-trivial. An architecture has many possible views. Documenting all the potentially useful ones is time consuming and expensive.
An architecture design often evolves Keeping the architecture documents current is often forgotten, especially with time and schedule pressures in a project.
152
Think carefully about what to document
Project complexity A small project may only need a ‘marketecture’ Project longevity One-off stop gap software?
Strategic, long-term, will evolve?
Needs of stakeholders Small team, a whiteboard might be ok Large, dislocated team needs more Integrators? Testers? Programmers?
Need to spend documentation dollars/euros wisely on high value products 153
UML 2.0
UML is a powerful way to document an architecture Provides a relatively formal, unambiguous description New features in UML 2.0 appropriate for architectures Good tools available, some free Can be used to depict various structural/behavioral architecture views 154
Component Diagram
id Component View OrderProcessing
«table»
NewOrders
read 1 writeQ 1
MailQueue
validate 1 1
CustomerSystem
1 writeQ 1
OrderQueue
readQ 1
SendEmail
1 send 1
MailServer OrderSystem
155
Class Diagram
cd OrderProcessing OrderReader
1 1 1 1
Validate
1
QueueWriter
1
Store
156
Sequence Diagram
sd Interactions OrderReader
NewOrders readOrderData success:= validateOrder
Validate
success:= newOrder success:= storeOrder CustomerSystem
Store
success:= writeQueue success:= acknowledgeOrderSuccess OrderQueue
QueueWriter
success:= writeQueue MailQueue 157
Deployment Diagram
dd Deployment View OrderServer OrdersDB
«table»
:NewOrders
JDBC «executable»
: OrderProcessing
SOAP
CustomerSystem CRM
«executable»
: CustomerSystem MOMServer :MailQueue ERP OrderSystem
«executable»
: OrderSystem :OrderQueue MailServer :MailServer :SendEmail
158
Component Interfaces
id Component View OrderProcessing
«table»
NewOrders
JDBC QueueWrite
MailQueue
QueueWrite QueueRead
SendEmail
CustomerServices
OrderQueue
QueueRead SMTP
MailServer CustomerSystem OrderSystem
159
Component Decomposition
id Component View
«table»
NewOrders
JDBC
OrderProcessing
QueueWrite getOrders writeConfirmation validateOrder writeOrder
MailQueue
QueueRead QueueWrite
SendEmail
CustomerServices
OrderQueue
QueueRead SMTP
MailServer
validate
CustomerSystem OrderSystem
ProvidedInterface1
cd Component View OrderProcessing
«delegate»
or: OrderReader qw: QueueWriter
«delegate» getOrders writeConfirmation
val: Validate st: Store
«delegate» validateOrder «delegate» writeOrder 160
Document Template
Documentation is easier if there’s a template to use Reduces start-up time for projects by providing ready-made document structures familiarity gained with the document structure aids in the efficient capture of project design details. help with the training of new staff 161
Template Headings
3 3.1
3.2
3.3
3.4 3.5
4 4.1 4.2
Architecture Documentation Template Project Name: XXX 1 Project Context 2 2.1
Architecture Requirements Overview of Key Objectives 2.2
2.3
2.4
2.5
2.6
Architecture Use Cases Stakeholder Architectural Requirements Constraints Non-functional Requirements Risks Solution Relevant Architectural Patterns Architecture Overview Structural Views Behavioral Views Implementation Issues Architecture Analysis Scenario analysis Risks 162
Summary
Some documentation is nearly always a good idea Trick is to produce ‘just enough’ and no more requires upfront planning and thinking Commitment to keeps docs current UML 2.0 makes architecture documentation easier Some good UML 2.0 tools, try ‘em out.
163
Essential Software Architecture
Session 7: ICDE Case Study Design 164