Database High Availability Simplified

FlashGrid Storage Fabric

Shared storage is critical for seamless failure handling with zero downtime and zero data loss. FlashGrid Storage Fabric software enables high-speed shared storage in a variety of infrastructure environments including public cloud, bare-metal servers, virtual machines, or extended distance clusters. FlashGrid Read-Local™ Technology minimizes network overhead and enables storage speeds higher than with flash arrays.

Architecture Highlights

  • Shared storage based on local storage devices
  • Support for various storage device types: NVMe SSD, SAS SSD, virtual disks
  • Storage devices attached to compute nodes (hyper-converged) or in separate storage nodes
  • Standard x86 servers, on-premise VMs, or public cloud VMs used as compute and storage nodes
  • Fully distributed architecture with no single point of failure
  • Mirroring of data across nodes, data centers, or cloud availability zones
  • Network connectivity using commodity Ethernet
  • FlashGrid Read-Local Technology minimizes network overhead by serving reads from local storage devices
  • Seamless integration with Oracle ASM and Clusterware

Shared Access

FlashGrid Storage Fabric makes every storage device accessible from every database node in the cluster.

High Availability and Data Mirroring

FlashGrid has a fully distributed architecture with no single point of failure. FlashGrid leverages Oracle ASM’s existing capabilities for mirroring data. In Normal Redundancy mode each block of data has two mirrored copies. In High Redundancy mode each block of data has three mirrored copies. Each ASM disk group is divided into failure groups – one failure group per node. Each disk is configured to be a part of a failure group that corresponds to the node where the disk is physically located. ASM makes sure that mirrored copies of a block are placed on different failure groups. In Normal Redundancy mode the cluster can withstand loss of one (converged or storage) node without interruption of service. In High Redundancy mode the cluster can withstand loss of two (converged or storage) nodes without interruption of service.

FlashGrid Read-Local™ Technology

In converged clusters the read traffic can be served from local SSDs at the speed of the PCIe bus instead of travelling over the network. In 2-node clusters with 2-way mirroring or 3-node clusters with 3-way mirroring 100% of the read traffic is served locally because each node has a full copy of all data. Because of the reduced network traffic the write operations are faster too. As a result, even 10 GbE network fabric can be sufficient for achieving outstanding performance in such clusters for both data warehouse and OLTP workloads. For example, a 3-node cluster with four NVMe SSDs per node can provide 30 GB/s of read bandwidth, even on a 10 GbE network.

“With FlashGrid and AWS we can now deploy a new application within two weeks instead of six months, without compromising our availability SLA.”

David Urban, VP Operations, Aria Systems

“FlashGrid SkyCluster provides us all elements needed for running our mission-critical Oracle databases in Azure: storage and networking software, deployment automation, and 24x7 support.”

Jay Wilder, Sr. Director of Software Engineering, Nuance Communications

“Implementing Oracle RAC with FlashGrid in AWS is an important milestone in Digital Virgo’s cloud journey.”

Mikolaj Klimek, Deputy Director IT Operations and Infrastructure, Digital Virgo

“With our new virtualized RAC running on FlashGrid architecture we do a 125GB full table scan in 6 seconds. That is an 11x improvement over our previous setup.”

Sam Shiel, Oracle DBA, Simplyhealth