One gram of DNA now capable of storing over 500Pb of data thanks to synthetic DNA breakthrough

0 3

By Matthew Griffin Computing 20th March 2022

WHY THIS MATTERS IN BRIEF

The rate of information growth is now so vast that in a century the planet could end up being just one giant datacenter, so we need new ways to store huge volumes of data cheaply and easily.

Love the Exponential Future? Join our XPotential Community, future proof yourself with courses from XPotential University, connect, watch a keynote, read our codexes, or browse my blog.

The amount of information that the world is creating has careered way beyond the zettabyte range and it’s accelerating. As a result organisations everywhere are struggling to store all this data let alone analyse it. But, what if you could store all this data in a space the size of a shoebox and then shove it all into the cloud like Microsoft are planning on doing? It’s possible with DNA storage – the next next evolution of hard drive and tape storage that companies today are already building.

As with most things, nature’s data storage system, DNA, far surpasses anything we’ve created and so far researchers have been able to demonstrate that just a single gram of it can hold a whopping 215 Petabytes of data. Now, researchers at the University of Illinois Urbana-Champaign have doubled DNA’s already incredible storage capacity by adding extra letters to its “alphabet” – to create an 11 base pair synthetic DNA alphabet like the ones I’ve discussed before that were used to create “alien” 8 base pair organisms.

DNA is naturally made up of combinations of four nucleobases: adenine, guanine, cytosine and thymine. Represented by the letters A, G, C and T, these bases group together in different sequences to form blueprints for every living organism. And this information storage system is incredibly dense.

That of course makes it a very attractive potential storage solution for the huge amounts of data modern society produces daily. And as if 215 Petabytes per gran wasn’t dense enough, the researchers on the new study have found a way to double it to over 500 petabytes per gram.

Along with the usual A, G, C and T, the team effectively added an extra seven “letters” to the DNA alphabet. These take the form of chemically modified nucleotides, opening up more varied combinations that allow more information to be stored within the same amount of physical space.

“Imagine the English alphabet,” said Kasra Tabatabaei, co-author of the study. “If you only had four letters to use, you could only create so many words. If you had the full alphabet, you could produce limitless word combinations. That’s the same with DNA. Instead of converting zeroes and ones to A, G, C, and T, we can convert zeroes and ones to A, G, C, T, and the seven new letters in the storage alphabet.”

Of course, adding extra nucleotides means that existing systems for reading data back won’t recognize them, so the team also developed a new system that can. The DNA strand passes through a nanopore in a specially designed protein, which can detect the individual units regardless of whether they’re natural or synthetic. Machine learning algorithms then decode the information stored within.

“We tried 77 different combinations of the 11 nucleotides, and our method was able to differentiate each of them perfectly,” said Chao Pan, co-author of the study. “The deep learning framework as part of our method to identify different nucleotides is universal, which enables the generalizability of our approach to many other applications.”

In addition to density, the new method also improves the writing speed of the data, which is normally a fairly sluggish process for DNA. This system roughly halved the amount of time it takes to write information to DNA.

This work could help make DNA a viable data storage system, although there’s still plenty more work left to be done.

The research was published in the journal Nano Letters.

Source: University of Illinois Urbana-Champaign

Matthew Griffin / About Author

Matthew Griffin is a multi-award winning Futurist and expert in Disruption and Innovation, Geopolitics, Leadership, and Technology, who NASA have described as a "walking encyclopaedia of the future" and a "futurist Polymath." 15-time best selling author of the "Codex of the Future" series, Matthew is the Founder and Futurist in Chief of the 311 Institute, a global Futures and Deep Futures advisory firm working with royal households, world leaders, G7, G20, and G77 governments, NGOs, and multi-national mid and mega cap firms to help them explore, shape, and lead the next 50 years of business and society.

An award-winning YouTube creator with over a million followers, with an unrivalled global reach and impact, Matthew is a highly sought-after international keynote speaker, lecturer, and mentor who collaborates with global leaders through the United Nations Alliance of Civilizations (UNAOC) and United Nations General Assembly (UNGA) to shape pivotal initiatives such as the UN’s AI for Humanity program, the United Nations Conference of the Parties (UN COP), and the World Economic Forum in Davos.

As the former Global Head of Cloud, National Security, and Enterprise Sales for companies including Atos, Dell-EMC, and IBM, Matthew has a proven track record of building multi-billion dollar business units and turning failing divisions into market leaders. His ability to identify, analyse, and communicate the implications of hundreds of emerging technologies and trends is unparalleled, and his insights are trusted by many of the world’s most respected organisations, including ABB, Accenture, Adidas, AON, ARM, BCG, Centrica, Citi, Coca-Cola, Dentons, Deloitte, Dow Jones, EY, Google, KPMG, Lego, Legal & General, LinkedIn, Microsoft, PepsiCo, Qualcomm, RWE, Samsung, Siemens AG and Siemens Energy, T-Mobile, UBS, VISA, Walmart, Workday, Worldpay and many others.

Regularly featured in the global media including the AP, BBC, Bloomberg, CNBC, Discovery, Forbes, Khaleej Times, Telegraph, TIME, ViacomCBS, WIRED, and the WSJ, Matthews mission is to help organisations create a fair and sustainable future whose benefits are shared by everyone irrespective of their ability, background, or circumstances.