PyMOL and Inkscape Bridge the Data and the Data Visualization
2016; Elsevier BV; Volume: 24; Issue: 12 Linguagem: Inglês
10.1016/j.str.2016.11.012
ISSN1878-4186
AutoresShuguang Yuan, H. C. Stephen Chan, Sławomir Filipek, Horst Vogel,
Tópico(s)Genetics, Bioinformatics, and Biomedical Research
ResumoStructural biology has come a long way, from being a niche interest of specialists to where we are today, with over 127,500 biomolecular structures available in the Protein Data Bank (PDB) as of the writing of this letter. The two main factors that have been pushing structural biology forward are the remarkable technical advancements that make solving structures of more complex and larger biomolecules and biomolecular assemblies possible, and the growing appreciation that structure and function are inseparable and intertwined (Ouzounis et al., 2003Ouzounis C.A. Coulson R.M. Enright A.J. Kunin V. Pereira-Leal J.B. Nat. Rev. Genet. 2003; 4: 508-519Crossref PubMed Scopus (90) Google Scholar). In practical terms, it means that we can look deeper into physiology and biological processes if we have structural information about the key biomolecules involved at the ready (Lee et al., 2007Lee D. Redfern O. Orengo C. Nat. Rev. Mol. Cell Biol. 2007; 8: 995-1005Crossref PubMed Scopus (383) Google Scholar, Yuan et al., 2016aYuan S. Filipek S. Vogel H. Structure. 2016; 24: 816-825Abstract Full Text Full Text PDF PubMed Scopus (36) Google Scholar, Yuan et al., 2016bYuan S. Peng Q. Palczewski K. Vogel H. Filipek S. Angew. Chem. Int. Ed. Engl. 2016; 55: 8661-8665Crossref Scopus (21) Google Scholar) and if we know how to use this information efficiently. Several years ago, O’Donoghue et al., 2010O’Donoghue S.I. Goodsell D.S. Frangakis A.S. Jossinet F. Laskowski R.A. Nilges M. Saibil H.R. Schafferhans A. Wade R.C. Westhof E. Olson A.J. Nat. Methods. 2010; 7: S42-S55Crossref Scopus (119) Google Scholar put together a powerful argument for the critical role visualization plays in understanding and using macromolecular structures. At the same time, O’Donoghue et al., 2010O’Donoghue S.I. Goodsell D.S. Frangakis A.S. Jossinet F. Laskowski R.A. Nilges M. Saibil H.R. Schafferhans A. Wade R.C. Westhof E. Olson A.J. Nat. Methods. 2010; 7: S42-S55Crossref Scopus (119) Google Scholar highlighted that existing tools routinely used to visualize structural biology information lack elegance and simplicity and often get in the way of helping those not trained in structural biology to interface and interact with three-dimensional (3D) biomolecular structures in an intuitive and seamless manner. That is not to say that biologists have not been using visualization tools developed over the years (Humphrey et al., 1996Humphrey W. Dalke A. Schulten K. J. Mol. Graph. 1996; 14 (27–28): 33-38Crossref PubMed Scopus (38710) Google Scholar, O’Donoghue et al., 2010O’Donoghue S.I. Goodsell D.S. Frangakis A.S. Jossinet F. Laskowski R.A. Nilges M. Saibil H.R. Schafferhans A. Wade R.C. Westhof E. Olson A.J. Nat. Methods. 2010; 7: S42-S55Crossref Scopus (119) Google Scholar, Pettersen et al., 2004Pettersen E.F. Goddard T.D. Huang C.C. Couch G.S. Greenblatt D.M. Meng E.C. Ferrin T.E. J. Comput. Chem. 2004; 25: 1605-1612Crossref PubMed Scopus (28163) Google Scholar) with success. However, many of these tools have some drawbacks; for example, some of them do not support Linux and/or the MacOS platform, some of them do not support Python programming or they use other programming languages that cannot batch multiple tasks in an easy way, some of them suffer from low rendering speed and efficiency, some of them depend heavily on command line and the interfaces that are not user-friendly, and the list can go on and on. We realize that developing the perfect software that would serve specialists and novices equally well and enable discovery as well as encourage exploration or even play is a difficult task. We also appreciate that there may not be a perfect solution that would tick off all the boxes. Having said that, in this Letter we would like to share our experience working with two visualization tools that to our mind stand apart from the rest as being both easy to use and very versatile. The two tools are PyMOL and Inkscape, and in our hand, using the two together makes a real difference to how easily we can generate visualization for our structural biology information, and how well and how clearly these images represent our data (http://crosstalk.cell.com/blog/easy-and-simple-way-to-visualize-structural-biology-data). PyMOL (http://www.pymol.org) is a user-friendly tool (Schrodinger, 2015Schrodinger L.L.C. The PyMOL Molecular Graphics System, Version 1.8. Shrodinger LLC, 2015Google Scholar) that works across all platforms including Linux, MacOS, and Windows. It has been widely used in visualizing macromolecules (Miller et al., 2015Miller R.L. Thompson A.A. Trapella C. Guerrini R. Malfacini D. Patel N. Han G.W. Cherezov V. Caló G. Katritch V. Stevens R.C. Structure. 2015; 23: 2291-2299Abstract Full Text Full Text PDF Scopus (61) Google Scholar). Most tasks of rendering or representations could be done by clicking the mouse or selecting from the menu. PyMOL is a Python-based tool, and many Python plugins have been developed to enhance the visualization and even molecular modeling. These features facilitate batching multiple jobs simultaneously. Users could also take advantage of Python, the most popular scripting language, to render their system in a more sophisticated way. Moreover, the high quality rendering in PyMOL is supported by parallel processing with multiple CPUs, which dramatically boosts the performance of generating the high quality images. On top of it, users can specify an appropriate amount of memory for the rendering so that PyMOL does not occupy too many hardware resources. Furthermore, PyMOL supports a command line, which is of great help in handling multiple objects. We would like to point out that in addition to being an accurate representation of complex structural biology data, these images should also have a high esthetic value and convey not only the sense of science, space, and functionality, but the sense of elegance. PyMOL and similar tools create accurate and workable images; however, additional work is needed to get the most out of data visualization in structural biology and here vector drawing tools are invaluable. There are many good vector drawing tools available, and we would like to highlight Inkscape (http://www.inkscape.org), a free vector graphics software of professional quality. Inkscape supports all platforms including Linux, MacOS, and Windows. The remarkable attribute of Inkscape is its use of scalable vector graphics (SVG), an open XML-based W3C standard, as the native format (Minatani, 2015Minatani K. Stud. Health Technol. Inform. 2015; 217: 506-511Google Scholar). Inkscape can render primitive vector shapes (such as rectangles, ellipses, polygons, arcs, spirals, stars, and isometric boxes), text, and regions containing raster graphics. It also supports image tracing, enabling the editor to create vector graphics from photos and other raster sources. Inkscape can easily export high resolution images in various formats including cairo PNG (.png), portable document format (.pdf), postscript (.ps), HTML5 canvas (.html), and so on. Command line options for exporting and conversions make Inkscape more efficient in handling a large number of tasks in parallel. In our hands, we found that combining PyMOL and Inkscape allows for easy and efficient presentation of structural biology data in a clear, professional, and artistic way. One can first create each component (such as cartoons, surfaces, ball-and-sticks, meshes, and so on) for illustration in PyMOL, then assemble them in Inkscape by lines, ellipses, curves, texts, and so on. You can see some examples of our images in the blog post accompanying this letter (http://crosstalk.cell.com/blog/easy-and-simple-way-to-visualize-structural-biology-data). In conclusion, PyMOL is a dedicated tool for macromolecular visualization, whereas Inkscape is a vector software of professional quality. Using PyMOL and Inkscape together provides an easy and elegant way of visualizing structural biology information. PyMOL and Inkscape bridge the data and the data visualization. S.Y. initialized the project. All authors wrote the manuscript. S.F. received funding from the National Center of Science, Poland, grant no. 2014/15/B/ST4/04949. Research in H.V.’s group was supported by the European Community (project SynSignal, grant no. FP7-KBBE-2013-613879) and internal funds of the EPFL and the Arnold and Mabel Beckman Foundation. H.V. and S.F. participate in the European COST Action CM1207 (GLISTEN).
Referência(s)