Information de reference pour ce titreAccession Number: | 00006056-201302070-00045.
|
Author: | Goldman, Nick 1,*; Bertone, Paul 1; Chen, Siyuan 2; Dessimoz, Christophe 1; LeProust, Emily M. 2; Sipos, Botond 1; Birney, Ewan 1
|
Institution: | (1)European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton CB10 1SD, UK (2)Agilent Technologies, Genomics-LSSU, 5301 Stevens Creek Boulevard, Santa Clara, California 95051, USA
|
Title: | Towards practical, high-capacity, low-maintenance information storage in synthesized DNA.[Letter]
|
Source: | Nature. 494(7435):77-80, February 7, 2013.
|
Abstract: | : Digital production, transmission and storage have revolutionized how we access and use information but have also made archiving an increasingly complex task that requires active, continuing maintenance of digital media. This challenge has focused some interest on DNA as an attractive target for information storage 1 because of its capacity for high-density information encoding, longevity under easily achieved conditions 2,3,4 and proven track record as an information bearer. Previous DNA-based information storage approaches have encoded only trivial amounts of information 5,6,7 or were not amenable to scaling-up 8, and used no robust error-correction and lacked examination of their cost-efficiency for large-scale information archival 9. Here we describe a scalable method that can reliably store more information than has been handled before. We encoded computer files totalling 739 kilobytes of hard-disk storage and with an estimated Shannon information 10 of 5.2 x 106 bits into a DNA code, synthesized this DNA, sequenced it and reconstructed the original files with 100% accuracy. Theoretical analysis indicates that our DNA-based storage scheme could be scaled far beyond current global information volumes and offers a realistic technology for large-scale, long-term and infrequently accessed digital archiving. In fact, current trends in technological advances are reducing DNA synthesis costs at a pace that should make our scheme cost-effective for sub-50-year archiving within a decade.
(C) 2013 Nature Publishing Group
|
References: | 1. Baum, E. B. Building an associative memory vastly larger than the brain. Science 268, 583-585 (1995)
2. Cox, J. P. L. Long-term data storage in DNA. Trends Biotechnol. 19, 247-250 (2001)
3. Anchordoquy, T. J. & Molina, M. C. Preservation of DNA. Cell Preserv. Technol. 5, 180-188 (2007)
4. Bonnet, J. et al. Chain and conformation stability of solid-state DNA: implications for room temperature storage. Nucleic Acids Res. 38, 1531-1546 (2010)
5. Clelland, C. T., Risca, V. & Bancroft, C. Hiding messages in DNA microdots. Nature 399, 533-534 (1999)
6. Kac, E. Genesis (1999); available at http://www.ekac.org/geninfo.html- ouverture dans une nouvelle fenêtre (accessed, 10 May 2012)
7. Ailenberg, M. & Rotstein, O. D. An improved Huffman coding method for archiving text, images, and music characters in DNA. Biotechniques 47, 747-754 (2009)
8. Gibson, D. G. et al. Creation of a bacterial cell controlled by a chemically synthesized genome. Science 329, 52-56 (2010)
9. Church, G. M., Gao, Y. & Kosuri, S. Next-generation digital information storage in DNA. Science 337, 1628 (2012)
10. MacKay, D. J. C. Information Theory, Inference, and Learning Algorithms (Cambridge Univ. Press, 2003)
11. Erlich, H. A., Gelfand, D. & Sninsky, J. J. Recent advances in the polymerase chain reaction. Science 252, 1643-1651 (1991)
12. Monaco, A. P. & Larin, Z. YACs, BACs, PACs and MACs: artificial chromosomes as research tools. Trends Biotechnol. 12, 280-286 (1994)
13. Carr, P. A. & Church, G. M. Genome engineering. Nature Biotechnol. 27, 1151-1162 (2009)
14. Willerslev, E. et al. Ancient biomolecules from deep ice cores reveal a forested southern Greenland. Science 317, 111-114 (2007)
15. Green, R. E. et al. A draft sequence of the Neandertal genome. Science 328, 710-722 (2010)
16. Kari, L. & Mahalingam, K. in Algorithms and Theory of Computation Handbook Vol. 2, 2nd edn (eds Atallah, M. J. & Blanton, M.) 31-1-31-24 (Chapman & Hall, 2009)
17. Paun, G., Rozenberg, G. & Salomaa, A. DNA Computing: New Computing Paradigms (Springer, 1998)
18. Watson, J. D. & Crick, F. H. C. Molecular structure of nucleic acids. Nature 171, 737-738 (1953)
19. Niedringhaus, T. P., Milanova, D., Kerby, M. B., Snyder, M. P. & Barron, A. E. Landscape of next-generation sequencing technologies. Anal. Chem. 83, 4327-4341 (2011)
20. LeProust, E. M. et al. Synthesis of high-quality libraries of long (150mer) oligonucleotides by a novel depurination controlled process. Nucleic Acids Res. 38, 2522-2540 (2010)
21. Massingham, T. & Goldman, N. All Your Base: a fast and accurate probabilistic approach to base calling. Genome Biol. 13, R13 (2012)
22. Gantz, J. & Reinsel, D. Extracting Value from Chaos (IDC, 2011)
23. Brand, S. The Clock of the Long Now (Basic Books, 1999)
24. Digital. archiving. History flushed. Economist 403, 56-57 (28 April 2012); available at http://www.economist.com/node/21...- ouverture dans une nouvelle fenêtre (2012)
25. Bessone, N., Cancio, G., Murray, S. & Taurelli, G. Increasing the efficiency of tape-based storage backends. J. Phys. Conf. Ser. 219, 062038 (2010)
26. Baker, M. et al. in Proc. 1st ACM SIGOPS/EuroSys European Conf. on Computer Systems (eds Berbers, Y. & Zwaenepoel, W.) 221-234 (ACM, 2006)
27. Yuille, M. et al. The UK DNA banking network: a "fair access" biobank. Cell Tissue Bank. 11, 241-251 (2010)
28. Global Crop Diversity Trust Svalbard Global Seed Vault. (2012); available at http://www.croptrust.org/main/co...- ouverture dans une nouvelle fenêtre (accessed, 10 May 2012)
|
Language: | English.
|
Document Type: | Letters.
|
Journal Subset: | Life & Biomedical Sciences. Science.
|
ISSN: | 0028-0836
|
NLM Journal Code: | 0410462, nsc
|
DOI Number: | https://dx.doi.org/10.1038/natur...- ouverture dans une nouvelle fenêtre
|
Annotation(s) | |
|
|