How to obtain the nucleotide sequences DNA from gene bank data base
Abstract
A method of extraction nucleotide sequences of human mRNA from Gene Bank flat format of NCBI was proposed. The method is based on using regular expressions PERL language on the basis of Extended Backus–Naur Forms (EBNF). Suggested for work set of regular expressions could be used for the analysis of other genomes' sequences that are presented in Gene Bank flat format (gbk).
Downloads
References
2. Cochrane G. R., Galperin M. Y. The 2010 Nucleic Acids Research Database Issue and online database collection: a community of data resources // Nucl. Acids Res. – 2010.– V.38.– D1-D4; doi:10.1093/nar/gkp1077.
3. Sayers E. W., Barrett T., Benson D. A. [et al.]. Database resources of the National Center for Biotechnology Information // Nucl. Acids Res. – 2010.– V. 38.– D5-D16; doi:10.1093/nar/gkp967.
4. The NCBI C++ Toolkit [Internet] / edit. Vakatov D., Siyan, K., Ostell J. – Bethesda (MD): National Library of Medicine (US), NCBI; 2004.
5. Gilat A. MATLAB: An Introduction with Applications 2nd Edition. – John Wiley & Sons. ISBN 978-0-471-69420-5.– 2004.
6. MATLAB technical documentation [Access] http://www.mathworks.com/access/helpdesk/help/toolbox/bioinfo/ref/seqtool.html
7. http://www.ncbi.nlm.nih.gov/IEB/ToolBox/SDKDOCS/INDEX.HTML
8. Водолазкий В,Семериков В. Энциклопедия PERL. СПб.: Питер. - 2002.-576С.
9. ftp://ftp.ncbi.nih.gov/genbank/genomes/H_sapiens
10. International Human Genome Sequencing Consortium. The NCBI Handbook [Web resource]: Bethesda National Library of Medicine (US), NCBI 2002-2005 / edit. J. McEntyre, J. Ostell . – [Access]: http://www.ncbi.nlm.nih.gov/books/bv.fcgi?rid=handbook
11. The DDBJ/EMBL/GenBank Feature Table: Definition [Web resource]: Bethesda.– 2007 / International Sequence Databank Collaboration.– [Access]: http://www.ncbi.nlm.nih.gov/projects/collab/FT/index.html#7.4
12. NCBI Help Manual.– Bethesda (MD): National Library of Medicine (US), NCBI; 2005-2009.
13. Компиляторы: принципы, технологии и инструментарий / А. В. Ахо, М. С. Лам, Р. Сети, Д. Д. Ульман. – М.: Вильямс. –2008. –2-е издание. –1184 c.
14. Wain H. M., Bruford E. A., Lovering R. C., Lush M. J., Wright M. W. and Povey S. Guidelines for Human Gene Nomenclature//Genomics.-2002.-№79(4). - Р. 464-470.
15. Пат. 43786 Україна, МПК (2007) G 06 F, 17/21. Спосіб перетворення структури послідовностей генів бази даних / Дуплій Д.Р., Калашніков В.В., Чащин Н.А., заявник та патентовласник Київ. Ін-т Молекулярної Біології и Генетики; Держ.реєстр. від 25.08.09.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).