开发者

Getting top 10 sequences of BLAST results Bio Python

开发者 https://www.devze.com 2023-03-17 02:04 出处:网络
I want to get top 10 sequences of BLAST results (just the sequences, no alignment or score or e-value etc). I am inputting a text file containing 5 fasta file. So my output should be top 10 blast hits

I want to get top 10 sequences of BLAST results (just the sequences, no alignment or score or e-value etc). I am inputting a text file containing 5 fasta file. So my output should be top 10 blast hits of each fasta file.. therefore my output file will have 50 sequences.

I am reading each of my input fasta file through Bio.SeqIO, writing it as temp.faa and then passing it to command 开发者_运维问答line BLAST through subprocess as

blastp -db nr -query temp.faa -out out.faa -evalue 0.001 -gapopen 11 -gapextend 1 -matrix BLOSUM62 -remote -outfmt 2

the output has lots of other information. Should I parse this output now or there's a better way.

Thanks

P.S XML might be a way but I didn't find a relavant NCBIXML parser syntax.


Solutions on BioStar StackExchange:

http://biostar.stackexchange.com/questions/9880/getting-top-10-sequences-of-blast-results-bio-python

http://biostar.stackexchange.com/questions/9882/parsing-blast-output-biopython-error

0

精彩评论

暂无评论...
验证码 换一张
取 消

关注公众号