Abstract on DNA digital data storage
Digital data has changed the use and access of information. Everyday lot of data is produced and this requires high-density storage devices which can retain values for a long time. Deoxyribonucleic acid (DNA) can be potentially used for these purposes as it is not much different from the conventional method used in a computer. DNA can be used as a robust and high-density storage device even under unfavourable conditions. Theoretically, one can encode 2 bits per nucleotide in DNA which can store 455 exabytes per gram maximum data in single-stranded DNA (ssDNA). In this paper, the method described can be used to store text data in DNA by compressing, storing multiple copies along with providing security to data.
The demand for data storage devices is increasing day by day as more and more data is generated every day. Total information in digital format in the year 2012 was about 2.7 zettabytes. Presently devices such as optical discs, portable hard drives, and flash drives are used to store data. But silicon and the other non-biodegradable materials used in data storage pollute the environment. Also, they are available in limited quantities. Thus, they would be exhausted one day. The linear density of digital storage device is 10 kb per square mm. Hence, newer technology is needed for data storage and archival process. As the data increases, the current data storage technology would not be enough to store data in future as data is growing every day. Even potentially important information can get lost due lack of storage space.
How does DNA digital data storage work?
The digital data is encoded in a DNA sequence, the corresponding sequence information is synthesized into an artificial DNA and the information is decoded by sequencing the artificial DNA strand. This is the exact path of storing and retrieving digital data from DNA.
Encoding data into the DNA sequence:
The computer is worked on a binary system of 1 and 2. In the very first step, digital data is incorporated into the DNA. The DNA has 4 nitrogenous bases: Adenine (A), Cytosine (C), Guanine (G) and Thymine (T). For storing data into the DNA, the A, T, G and C bases of DNA first converted into binary codes 1 and 0.
00 for A, 01 for G, 10 for C and 11 for T are the binary codes for storing information. The information in the binary form is converted into the sequence of A, T, G, C. Now we have the long digital sequence of DNA.
References:
https://www.researchgate.net/publication/303318968_Digital_Data_Storage_on_DNA
https://geneticeducation.co.in/dna-digital-data-storage/
https://en.wikipedia.org/wiki/DNA_digital_data_storage