A review of IP traffic on its effect within SAN

This brings iSCSI to mind. According to Network Magazine, iSCSI makes SAN data accessible to everyone, as this enables a node to access a storage target and set it up for access so that it appears to the OS as a local drive. The advantage of an IP transport in comparison to Fiber Channel is that it is possible to scale a SAN over a longer distance. “TCP/IP can run over any physical medium, so it offers a choice of topologies… and IP packets can travel across alternative LAN topologies just as easily ” (Hall, 2004). This can happen only if the network hardware is sufficient to support the additional Gigabit traffic that iSCSI causes, as iSCSI produces large amounts of traffic bursts. As TCP contains flow control that is very sensitive to loss and delay of packets which could result in negative performance impact, it is essential to implement the fastest switching and routing protocols such as Gigabit switches and router RIP. Currently, there are new 10Gbs Ethernet switches on the market that can more than handle the increase traffic bursts of iSCSI such as Intransa’s 10G Ethernet storage appliance that is designed for the demands of video on-demand, IPTV, digital imaging, video surveillance, and data mining” (Messmer, 2009) which demand those higher speeds. Let’s not forget the backup of data within the IP SAN. Data backup is the cause of a large amount of SAN traffic both on the wire and on the storage targets. Using a LAN-free approach, such as having a backup server within the SAN would eliminate much (IP) backup traffic so that the backup data is not flowing between the client node and the backup server, but only between the backup server and the storage target. As stated in Arkeia’s article on its new backup software, Arkeia Network Backup version 6.0, the “…Server for SAN option allows LAN-free backup for SAN environments, thus easing network congestion. It allows direct SAN connectivity of servers, storage arrays and tape libraries” (Connor, 2006).

References:

iSCSI: Storage Networking Sans the SAN? — Farewell to Fibre Channel? iSCSI lets storage traffic run over a regular TCP/IP network.(Internet SCSI)(Storage Area Networks)(Transmission Control Protocol/Internet Protocol).Eric A. Hall. Network Magazine (May 1, 2004): p62. (3170 words) 

Messmer, E. (2009). Intransa debuts 10G Ethernet storage appliance; Latest StorStac appliance offers up to 180TB of storage capacity. Network World. Academic OneFile. Gale. BCR Regis University. 
http://find.galegroup.com.dml.regis.edu/itx/start.do?prodId=AONE

Connor, D. (2009). Arkeia rolls out new backup software. Network World. Academic OneFile. Gale. BCR Regis University.
http://find.galegroup.com.dml.regis.edu/itx/start.do?prodId=AONE

Advertisements

IP as a viable transport for storage networks

netwrokIP is a viable transport method within SAN’s, as this is a great method for attaching additional client nodes to the SAN without the need of purchasing and installing additional (and expensive) HBA’s for the nodes. iSCSI is one of the tools that enables this to happen. With the use of a software iSCSI initiator, the node can view an assigned iSCSI file system on the storage end and attach to it as a local disk.

Another advantage of IP within a storage network is the utilization of the spanning tree protocol within TCP. Spanning tree monitors redundant links and assigns alternative network paths on demand in the event of a port failure. It also prevents any packet stream from accessing more than one path at a time, preventing packet or broadcast flooding in the network. Think of it as a traffic cop. Spanning Tree is valuable for SAN traffic, as it will keep the data going in the event of a link failure. This is known as transparent bridging in the text (p.81). With Fiber Channel, there is usually a second fiber switch that will enable continuity of the connection if the other goes down. It seems to me that IP can provide more links to the target.

Spanning Tree is being replaced presently in a few organizations by the new Flex Link protocol:

“Flex Link is a Layer 2 availability feature that can co-exist with spanning tree. This enhancement allows a convergence time of less than 50 milliseconds. In addition, this convergence time remains consistent regardless of the number of VLANs or MAC addresses configured on switch uplink ports. It is a pair of a Layer 2 interfaces, either switchports or port channels, that are configured to act as a backup to another Layer 2 interface. The feature provides an alternative solution to the spanning tree protocol (STP), and it allows users to turn off STP and still provide basic link redundancy”

Reference:

Cisco. (2009). How to configure a Flex Link for link-level redundancy in Cisco Catalyst switches that run Cisco IOS. Retrieved July 16, 2009 from http://supportwiki.cisco.com/ViewWiki/index.php/How_to_configure_a_Flex_Link_for_link-level_redundancy_in_Cisco_Catalyst_switches_that_run_Cisco_IOS

Data mining effects on SAN

dataminingToday’s e-business and scientific relational databases place a huge impact on SAN performance, especially within data mining. Data mining is the modern tool of the acquisition of demographic information on consumer buying habits. Within large e-business organizations, there is a constant demand on acquired customer demographic data: this data is examined within the database for buying trends and patterns to determine marketing and advertising strategy. This data is also compiled and sold. Within the scientific community, data mining is used within genetics. “A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP (single nucleotide polymorphism) data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information.” (Baye, et al, 2009). Population mapping is a large part of this database querying within data mining, and this places a large demand on SAN architecture, especially throughput and selection of communications medium such as fiber and data transport protocols.

For a SAN to provide the desired performance under this kind of load from data mining, it would require the implementing of good monitoring applications and smart storage management suites, which are available from the hardware vendors. One example of a data management suite is EMC’s Networker, an automated data accelerator: “…Networker backup software, you get a common platform that supports a wide range of data protection options including backup to disk, replication management, continuous data protection, and de-duplication across physical and virtual environments…” (Networker, 2009).

De-duplication is another process that is essential to data organization within storage, and there exist many applications that provide this valuable service; although it does cause more system overhead.

References:

Baye, T M, Tiwari, H K, Allison, D B, & Go, R C (Feb 14, 2009). Database mining for selection of SNP markers useful in admixture mapping.(Research)(single nucleotide polymorphisms)(Report). BioData Mining, 2, 1. p.1. Retrieved June 05, 2009, from Academic OneFile via Gale:
http://find.galegroup.com.dml.regis.edu/itx/start.do?prodId=AONE

Networker. (2009). EMC. Retreived June 5, 2009 from http://www.emc.com/products/detail/software/networker.htm