SATA in the Enterprise Willis Whittington Sr Mgr Interface Planning Seagate Technology Jeff Mastro Program Manager Microsoft Corporation [email protected] [email protected].
Download ReportTranscript SATA in the Enterprise Willis Whittington Sr Mgr Interface Planning Seagate Technology Jeff Mastro Program Manager Microsoft Corporation [email protected] [email protected].
SATA in the Enterprise Willis Whittington Sr Mgr Interface Planning Seagate Technology Jeff Mastro Program Manager Microsoft Corporation [email protected] [email protected] Session Outline Serial ATA Overview SATA Goals and Objectives Interface Enhancements and Features Compatibility with SAS SATA Enterprise Applications Market Segmentation & Tiered Storage “Nearline” Caveats for SATA in High End Applications Error Recovery & Data Integrity Reliability, Workload, Duty Cycles Environmental Considerations Q&A Session Goals Provide some insight into the use of high capacity low cost storage in a tiered storage environment. Understand the need for a new class of “Nearline” products which provide storage for reference data and infrequently accessed data such as financial compliance archives. Nearline High capacity, low cost storage of reference data and data protection copies that are infrequently accessed, such as financial compliance archives, medical records and disc-to-disc backup An appreciation for the challenges of using low cost storage in an Enterprise computing environment Storage Devices: Key Windows Strategies Storage Fabrics Server/Enterprise Personal Storage Client/Consumer Optical Platform Client/Consumer Preferred Storage Platform Partner/Customer Leading platform supporting storage fabrics Optimized platform features enabling new customer scenarios in Personal Storage Timely, comprehensive, quality platform support for optical devices Preferred platform for developing, integrating, and deploying storage devices Serial ATA Overview New topologies, new features, low cost. Moving from Parallel ATA to Serial ATA Summary of Goals and Objectives Primary inside the box storage connection S/w transparent to ATA Better cabling (1m) and connectors and a lower pin count PATA (18”) Higher performance, scalability and ~10 year roadmap 150 300 600 MB/sec Support for 1st Party DMA SATA (1m) Target can initiate data transfers to host memory Plug and Play with no jumpers or terminators Connecting to the I/O Connectors and cables are required to: Support 1.5 Gbps with headroom for 3.0 Gbps speed Be cost competitive to Ultra ATA Facilitate a smooth transition to Serial ATA Be suitable for both 3½” and 2½” storage devices Support up to 1 meter cable length Be blind-mateable and hot-pluggable (staggered pins) Data Rx Tx 3.3V Voltage Pins 5V R 12V Key Gnd Gnd Additional SATA II Features Backplane Interconnect Staggered spindle spin up SATA Native Command Queuing Performance improvement Non 512 byte block size Multiple block size support is not financially viable Port Multiplier Provides multi drive access to a host Port Selector Provides a failover path to a drive Port Multiplier Connects 1 Host to any one of 15 (max) SATA devices JBOD Host command selects Target device SATA Avoids changing cables to change Targets Port Contains switching/processing Multiplier On electronics backpanel Requires HBA support Needed to support multi-drive cabinets if no SAS Expanders available SATA Port Selector Provides an alternate path to the Target in the event of a failure Switching between hosts is tantamount to a cable change at the drive (I/O Initialization required) Active Connection Not to be confused with Dual Port Port Selector Passive Connection (1 per Drive) SATA Drive Compatibility with Serial Attached SCSI (SAS) SATA is plug compatible with SAS Connector Flip Side Port B SAS Keyway SATA SATA Pluggable SAS ☻ Compatibility with Serial Attached SCSI (SAS) SATA Compatibility with SAS provides distinct user advantages SATA drives are plug compatible with SAS backplanes SATA & SAS drives may be freely intermixed within an enclosure Serial Tunneling Protocol (STP) allows a SAS Host to communicate directly with a SATA Target SATA devices are addressable in a SAS Expander topology STP SATA SSP Create Tunnel to Target Drive Serial SATA Serial from SCSI Host Tunneling Protocol to Target Protocol SAS Drives SATA Drives SATA Enterprise Applications Nearline Storage: High Capacity, Low Cost Opportunities Nearline Disc Drives “Only a small percentage of data should be maintained on the highest cost, high performance devices. The rest is migrated to less expensive storage where it is immediately available when needed.” EMC Whitepaper April 2005 Transactional Reference Data Staging & spooling Archive On Line Near Line Off Line Performance Availability Access time High capacity Low cost-per-gigabyte High reliability Optimized for nearline applications Cost per Terabyte Capacity Retention Workload Descriptions Transactional Database (ES) Separately attached storage supporting 1000s of users Mission Critical Applications e.g. Bank records, ticketing, etc. General Purpose Server (ES) Web, Application, E-Mail server for 10-100 users Small databases and RAID. Nearline High capacity, low cost storage Reference data, Sarbanes---Oxley Disc-to-disc backup. Low Cost Server (PS) Departmental filing, printing, applications Small web or E-mail server Desktop (PS) Average office user E-Mail, browsing, reading / writing documents, printing. Typical Workloads by Application Nearline workloads are unique compared to online and low-cost server workloads Online Highly Transactional Database General Purpose Server Nearline Reference data Highly random High IOPS 24x7 power-on hours Disc-to-disc backup Highly random Low Cost Server Low IOPS 24x7 power-on hours Desktop High duty cycle Low-Cost Server Low duty cycle 8x5 power-on hours More sequential Frequent start/stop Low duty cycle Data Is Changing – and So Is Storage Worldwide Reference Information Capacity Forecast Server attached Disk Storage Worldwide, Digitized Reference Information created and stored will surpass NonReference Information by the end of 2006 2,500 2,250 2,000 Petabytes 1,750 1,500 1,250 1,000 750 500 250 0 2001 2002 Non-Reference Information Source: Enterprise Storage Group 2003 2004 Reference Information 2005 2006 Total Storage Capacity Caveat The right storage for the right application Nearline/Enterprise Storage Differences Cost Capacity Duty Cycle Reliability Scalability Environment Nearline Storage 7,200 rpm SATA Performance Data Integrity Power Consumption System Architecture Enterprise Storage 15,000 rpm SAS/FC Each application gets the parameters it needs Nearline / Enterprise Drive Differences Electronics Motor Discs Head Misc Mechanical Stack Dual Higher More Larger processors platters heads rpm Magnets Multi Less Smaller Low Stiffer mass, host runout Covers diameter high rigidity Dual More Full Higher Air Control media port expensive costcertification Devices design Twice Fully Fastercharacterized the Seeks firmware High Low RV rpmsusceptibility control & RPS Command scheduling Superior error correction Smart servo algorithms Perform. optimization Data integrity checks ATA Workload vs. Failure Rate Effect of Duty Cycle on ATA Drive Failures Cum % Failures %Failures Cumulative 8.0% Low End Server (Random) 7.0% 6.0% 5.0% High Duty Cycle (Sequential) 4x 4.0% 3.0% 2x Desktop Workload 2.0% 1.0% 0.0% 0 100 200 300 400 500 600 700 800 900 1000 1100 1200 Test Time, Hours HDT HDT_2 Desktop1 3 groups of 300 Desktop drives were tested in commercial test chambers All workloads were captured from exhaustive Field analyses In repeated testing, greater than 2x higher failure rates were recorded with Enterprise workload compared to Desktop UER* on Very High Capacity RAID Sets The UER for SATA products is 1 in 1014 bits read 1014 bits = 12½ terabytes A 500 Gbyte drive has 1/25 x 1014 bits Rebuilding a SATA drive in a RAID 5 set of 5 drives means transferring 5/25 x 1014 bits This means there is a 20% probability of an Unrecoverable Error during the rebuild. Better odds would be available with RAID 1 or 6 Higher redundancy, faster Rebuild, more expensive Risks can be reduced with good error management Intelligent rebuild (e.g. ignore unused capacity) Background media scan *Unrecoverable Error Rate Nearline UER Comparison to Enterprise Probability of Unrecoverable Errors during RAID Rebuild 20% Probability of UE 16% 12% SATA NL Drives UER = 10-14 8% UER = 10-15 in 2007 4% 3 4 5 # of 500 GB Drives in RAID Set Enterprise Drives UER = 10-15 UER = 10-16 in 2007 Alternate Sector Size – Not an Easy Proposition Standard 512 byte Format Servo Data 1 Data 2 Data 3 Servo Data 4 Non-Standard Format (allows for post amble check field) Servo Data 1 Data 2 Data 3 Servo Data 3 Data 4 Industry has standardized on 512 byte sectors H/w and s/w optimized for efficiency, cost and µP loading Manufacturing geared up for “vanilla” processes and production Problems with changing the sector size: More buffering required to handle the additional data More complex f/w requiring additional memory storage More complex split sector handling/computation required More complicated/costly media flaw mapping process Physical vs Logical Flaw Mapping Flaw somewhere in LBA 9 Flaw in LBA 7 or LBA 8 Flaw @ Cyl x, Hd y, Byte z Cyl Flaw in LBA 7 Index Logical flaw location is format dependent LBA can be computed from the physical loc. for any media format Physical media certification is a complicated operation Complex algorithm used Time consuming Manuf. Process Costly Reformat to 528 512 byte sectors byte sectors End to End Data Checking 520 Byte Sector Data Frame SOF Header User Logical Block Logical Block Application Guard Tag 2 Bytes 8 Bytes 512 Bytes 2 Bytes End to End CRC EOF Data Check Data Logical Block Reference Tag 4 Bytes Not available on SATA drives with fixed 512 byte sectors Enterprise Drive Data Integrity Provisions I/O Interface Electronics Disk Formatter Customer Data (512) + CRC Customer Data + Err. Det.Check + PBA Check 512 bytes data + 40 check bytes Data Buffers Customer Data + Err. Det.Check + LBA Check LBA = Logical Block Address PBA = Physical Block Address Protection against hardware failures in the data path Effectively eliminates the risk of unreported “data miscompare” errors Rotational Vibration Queue = 4 200 Mutual Interference 180 Increases seek time 160 Takes longer to settle on track 140 Actuator vibrates off track Enterprise 120 Performance (IO/sec) IO/s Read retries Aborted Writes Enterprise Requirement 100 80 60 40 20 Desktop 0 0 20 40 60 -20 Rotational Radians/sec**2 Vibration (Rads/sec2) 80 Rotational Vibration Average RV recorded on 33 Enterprise Cabinets Very Bad 40 R V (Rads/sec2) 35 30 25 20 Enterprise 15 10 5 Very Good 0 Desktop Cabinets Tested More stringent RV spec. needed for SATA cabinets RV aggravated by random operation and “bursty” workloads Summing Up …. SATA enables a Tiered Storage approach to data management by providing a high capacity low cost solution for what has come to be called the Nearline segment of the market. In the hierarchy of storage, Nearline provides continuity by filling, and overlapping, the slot between high end Personal Storage and Enterprise Although technological advances, driven by Enterprise research, will be leveraged into SATA products, there will continue to be functional limitations imposed on these devices by the overriding metric of low $/GB storage. and in Conclusion… Call To Action Use RAID 1 or RAID 6 in SATA Array Uncorrected Error Rate Mitigation Extra cost justified by value derived Rotational Vibration Consideration Exaggerated by Random Disk Operations Deploy SATA drives in a high-quality cabinet Extra cost justified by value derived Remember the Duty Cycle Optimal Nearline Drive Application Sequential Read Operations (WORM) Compliance, Email, Customer Record Archive Don’t Expect Desktop Drive to Handle Enterprise Load Additional Resources Web Resources: Specs: http://www.t13.org/#Standards Whitepapers: http://www.microsoft.com/whdc/device/storage/serialATA_FAQ.mspx http://www.sata-io.org/docs/serialata%20%20a%20comparison%20with%20ultra%20ata%20technology.pdf http://www.lsil.com/files/docs/marketing_docs/storage_stand_prod/Te chnology/SATA_II_White_Paper.pdf Questions on this topic [email protected] General WinHEC 2005 Comments [email protected]