Seminario 8feb. Processing FASTA sequences with the SEDA software. 31 xaneiro 2018

O xoves 8 de febreiro, ás 10 da mañá,  terá lugar o seminario na Sala de Conferencias de CACTI (Torre CACTI – Piso 0, xunto ao CITEXVI).

Ponente: Dr. Hugo López-Fernández, do Sistemas Informáticos de Nueva Generación (SI4).

Título: Processing FASTA sequences with the SEDA software

Esta charla será en castelán (ou inglés se asisten falantes non hispanos) , e non se requirirá ningunha inscrición.



One of the most important types of data used in biological research is DNA or protein sequence data. They are usually stored in FASTA files, which can contain one or more sequences. Public databases such as GenBank, NCBI or Ensembl provide huge collections of genomes, genome annotations, and so on, in FASTA format. Nevertheless, downloaded files usually must be preprocessed before subsequent analysis depending on each researcher needs. Despite the simplicity of these preprocessing operations (e.g. remove sequences without a minimum number of bases), processing of large batches of FASTA files is a complex task that usually requires advanced bioinformatics skills and the combination of different tools (including the bash command line) to achieve the desired result. In order to allow researchers to easily perform these operations, we are developing the SEDA software application ( presented in this seminar.