Startup packs all 16GB of Wikipedia onto DNA strands to demonstrate new storage tech

Authored by cnet.com and submitted by mvea
image for Startup packs all 16GB of Wikipedia onto DNA strands to demonstrate new storage tech

Computer storage technology has moved from wires with magnets to hard disks to 3D stacks of memory chips. But the next storage technology might use an approach as old as life on earth: DNA. Startup Catalog announced Friday it's crammed all of the text of Wikipedia's English-language version onto the same genetic molecules our own bodies use.

It accomplished the feat with its first DNA writer, a machine that would fit easily in your house if you first got rid of your refrigerator, oven and some counter space. And although it's not likely to push aside your phone's flash memory chips anytime soon, the company believes it's useful already to some customers who need to archive data.

DNA strands are tiny and tricky to manage, but the biological molecules can store other data than the genes that govern how a cell becomes a pea plant or chimpanzee. Catalog uses prefabricated synthetic DNA strands that are shorter than human DNA, but uses a lot more of them so it can store much more data.

Relying on DNA instead of the latest high-tech miniaturization might sound like a step backward. But DNA is compact, chemically stable -- and given that it's the foundation of the Earth's biology, it's arguably not as likely to become as obsolete as the spinning magnetized platters of hard drives or CDs that are disappearing today the way floppy drives already vanished.

Who's in the market for this kind of storage? Catalog has one partner to announce, the Arch Mission Foundation that's trying to store human knowledge not just on Earth but even elsewhere in the solar system -- like on Elon Musk's Tesla Roadster that SpaceX launched into orbit. Beyond that, Catalog isn't ready to say who other customers might be or if it'll charge for its DNA writing service.

"We have discussions underway with government agencies, major international science projects that generate huge amounts of test data, major firms in oil and gas, media and entertainment, finance, and other industries," the company said in a statement.

Catalog, based in Boston, has its own device to write data that can record 4 megabits per second right in DNA. Optimizations should triple that rate, letting people record 125 gigabytes in a single day -- about as much as a higher-end phone can store.

Conventional DNA sequencing products already for sale in the biotechnology market read the DNA data. "We think this whole new use case for sequencing technology will help [drive] down cost quite a bit," Catalog said, arguing that computing business is a potentially much larger market.

Two MIT graduate students, Chief Executive Hyunjun Park and Chief Technology Innovation Officer Nathaniel Roquet, founded Catalog in 2016.

Catalog uses an addressing system that means customers can use large data sets. And even though DNA stores data in long sequences, Catalog can read information stored anywhere using molecular probes. In other words, it's a form of random-access memory like a hard drive, not sequential access like the spools of magnetic tape you might remember from the heyday of mainframe computers a half century ago.

Although DNA data can be disrupted by cosmic rays, Catalog argues that it's a more stable medium than the alternatives. After all, we've got DNA from animals that went extinct thousands of years ago. How much do you want to bet that USB thumb drive in your desk drawer will be still useful even 25 years from now?

BleedingEdge on June 29th, 2019 at 17:39 UTC »

Imagine receiving a DMCA takedown notice because your parents decided to encode copyrighted material into your DNA.

N4yC4t on June 29th, 2019 at 16:56 UTC »

So when are we going to inject this into our brains?

kuschelbunny on June 29th, 2019 at 16:02 UTC »

All of wikipedia is 16gb?