Standards-Based Parallel Global File Systems and Automated Data Orchestration with NFS

Wed Sep 20 | 1:30pm

Location:

Salon VI, Salon VII

Abstract

High-performance computing applications, web-scale storage systems, and modern enterprises increasingly have the need for a data architecture that will unify at the edge, and in data centers, and clouds. These organizations with massive-scale data requirements need the performance of a parallel file system coupled with a standards-based solution that will be easy to deploy on machines with diverse security and build environments.

Standards-Based Parallel Global File System - No Proprietary Clients

The Linux community, with contributions from Hammerspace, has developed an embedded parallel file system client as part of the NFS protocol. With NFS 4.2, standard Linux clients now can read and write directly to the storage, and scale out performance linearly for both IOPS and throughput, saturating the limits of both storage and network infrastructures. Proprietary software is no longer needed to create a high-performance parallel file system, as NFS is built into open standards and included into Linux distributions. NFS 4.2 is a commercially driven follow-on to pNFS concepts.

Today’s data architectures span multiple types of storage systems at the edge, in the data center, and in the cloud. With the rise of data orchestration systems that place data on the appropriate storage, in the optimal geographic location, NFS 4.2 is a must-have technology to deliver high-performance workflows working with distributed data sets.

Automated Data Orchestration - Across Any Storage System

Hammerspace developed and contributed the Flexible Files technology to make it possible to provide uninterrupted access to data by applications and users while orchestrating data movement even on live files across incompatible storage tiers from different vendors and multiple geographic locations.

Flexibles

Files, along with mirroring, built-in real-time performance telemetry, and attribute delegation (to name a few) are put to work in a global data environment to non-disruptively recall layouts, which enables live data access and data integrity to be maintained, even as files are moved or copied. This has enormous ramifications for enterprises as it can eliminate the downtime traditionally associated with data migrations and technology upgrades. Enterprises can combine this capability with software, such as a metadata engine, that can virtualize data across heterogeneous storage types, and automate the movement and placement of data according to IT-defined business objectives.

Building a Global Data and Storage Architecture

Hammerspace brings NFSv4.2 (in addition to SMB and NFSv3) connectivity to its parallel global file system to build a standards-based, high-performance file system that spans existing and multiple otherwise incompatible storage systems from any vendor as well as across decentralized locations. In this way it can intelligently and efficiently automate orchestration of data to applications, compute clusters, or users that need it, enabling global access for analysis, distributed workloads, or to run AI-driven insights.

Learning Objectives

Learn how to unify data created in different clusters and locations into a single namespace, and place locally to applications and compute for processing and AI
Learn about the latest technologies available to deliver parallel file system performance from data sets stored in a hybrid cloud environment
Learn about the latest in standards-based technologies available for data orchestration and storage at scale

Download the Presentation

---

David Flynn

Hammerspace

Douglas Fallstrom
Hammerspace
Floyd Christofferson
Hammerspace

Related Sessions

SMB & NFS

Reparse Points Current Status

To implement SMB2 unix extensions, smbd needs to implement ntfs reparse points to present symlinks, sockets and other special files to clients.

Volker Lendecke

SerNet GmbH

Favorites

SMB & NFS

Samba io_uring Status Update

With the increasing amount of network throughput, we'll reach a point where a data copies are too much for a single cpu core to handle.

Stefan Metzmacher

SerNet/Samba-Team

Favorites

SMB & NFS

net use //samba/cloud: Scaling Samba

Current clustered Samba uses its homegrown distributed database "ctdb" as a storage backend for maintaining coherent fileserver state.

Ralph Böhme

Samba Team / SerNet

Favorites

SMB & NFS

Windows Protocol Test Suites: Architecture, Design, and Usage for Testing Protocol Implementations

The Windows Protocol Test Suites is an open source, cross platform application designed to enable the testing of implementations of selected Windows protocols.

Obaro Ogbo

Microsoft

Favorites

SMB & NFS

Advancing Access to Remote Files: Exploring Recent Enhancements to the Linux SMB3.1.1 Client

The Linux SMB3.1.1 client continues to be one of the most active filesystems in Linux with many improvements added each year, enhancing its ability to securely, reliably and efficiently access remo

Steven French

Microsoft

Favorites

Main menu

You are here

Standards-Based Parallel Global File Systems and Automated Data Orchestration with NFS