Pantheon DNA Data Storage CODEC: Experiences, Challenges, and Innovations

Mon Sep 18 | 2:30pm

Location:

Salon IV

Abstract

There are several well-known advantages of using synthetic DNA for cold-data storage, such as higher density, reduced energy consumption, and durability compared with the standard storage mediums used for the same purpose. The enablement of this technology in the market involves the development of cost-effective DNA synthesizers that can write the data at an appropriate throughput speed and a CODEC able to handle data from different synthesis and sequencing technologies. In the last two years, the Prometheus project, a partnership between Lenovo and IPT Institute in Brazil, has significantly progressed in developing DNA writing machines and a versatile CODEC. This presentation offers a comprehensive overview of the DNA data storage pipeline, providing real-world experiences from data encoding to storage and retrieval. Our primary goal is to provide the audience with valuable insights and practical knowledge regarding coding and decoding techniques, specifically emphasizing our designed error correction architecture. The CODEC developed includes not only the standard methods for storage systems, such as encoding and decoding algorithms, addressing, and error correction coding, but also comprises the application of standard techniques in the bioinformatics field known as sanitizing process, such as the removal of low-quality reads, adapter removal and filter for contaminants, followed by alignment, and clustering of sequenced reads. The last released version of the Lenovo DNA Data Storage CODEC, named Pantheon, is already applying the Sector scheme proposed by the SNIA DNA Archive Rosetta Stone (DARS) technical working group. Exciting results from experiments with this CODEC will be demonstrated and discussed. Finally, our presentation will inspire participants and provide a comprehensive overview of the complexity of implementing coding and adaptative decoding techniques for a functional DNA data storage system, including practical considerations, potential roadblocks, and viable solutions, drawing from our real-world experiences.

Learning Objectives

Lenovo DNA Data Storage CODEC advances
Tests with the TWG DNA Archive Rosetta Stone (DARS) Sector Scheme
Challenges and new proposals in processing DNA-sequenced reads using state-of-the-art computational methods

Download the Presentation

---

André da Costa Martins

IPT

Related Sessions

DNA Data & Archival Storage

Update on Standards for Consuming DNA Data Storage Archives

DNA lacks many key attributes found in other traditional storage media types including locality and addressability.

Daniel Chadash

Twist Bioscience

Joel Christner
Dell Inc.

Favorites

DNA Data & Archival Storage

Establishing Endurance and Data Retention Metrics in a DNA Data Storage System

Users of DNA as a digital data storage medium must have confidence that they can reliably recover their stored data, and to understand the competing capabilities and claims of codecs, readers, writ

David Landsman

Western Digital

Favorites

DNA Data & Archival Storage

Bit-to-DNA Writing Machines: a Microfluidic Platform and Future Data Center Operation Overview

Synthetic DNA-based data storage has been on the rise as a candidate for Data Storage due to its longer shelf life and higher data density.

Henrique Reis Wisinewski

Institute for Technological Research

Bruno Marinaro Verona
Institute for Technological Research

Favorites

DNA Data & Archival Storage

Approximate DNA Storage with High Robustness and Density for Images

Deoxyribonucleic Acid (DNA) as a storage medium with high density and long-term preservation properties can satisfy the requirement of archival storage for rapidly increased digital volume.

Bingzhe Li

University of Texas at Dallas

Favorites

DNA Data & Archival Storage

DNAe2c ECC for DNA Data Storage: 10x Improvement over RS Codes

A new error correction code for DNA data storage is presented.

Mario Montana

DNAalgo

Favorites

DNA Data & Archival Storage

Long Term Preservation and Archive Storage

The long-term retention and backup requirements of many organizations continue to grow as their data estate grows.

Shashidhar Joshi

Microsoft

Favorites

DNA Data & Archival Storage

Ceramic Nano Memory – Data Storage for the Yottabyte Era

The demand for data storage continues to grow exponentially with the overall data storage temperature cooling down with most data becoming cold after one month and subsequently infrequently accesse

Christian Pflaum

Cerabyte - Ceramic Data Solutions Holding GmbH

Favorites

DNA Data & Archival Storage

Cerabyte – Permanent Data Storage

The demand for data storage continues to grow exponentially with the overall data storage temperature cooling down with most data becoming cold after one month and subsequently infrequently accesse

Christian Pflaum

Cerabyte - Ceramic Data Solutions Holding GmbH

Favorites

DNA Data & Archival Storage

Inside the Cloud: A Deep Dive into Cold Data Archiving

Cold data holds significant value for regulatory compliance, audits, legal necessities, and disaster recovery, even though it's not frequently accessed.

Vikranth Etikyala

SoFi

Favorites

Main menu

You are here

Pantheon DNA Data Storage CODEC: Experiences, Challenges, and Innovations