On 03/09/2012 08:18 AM, Bernd Jagla wrote:
Hi, I have one comment: the link in the web page http://trac.seqan.de/wiki/Tutorial/FileIO2 that should link to "Double-Pass Record Reader <http://docs.seqan.de/seqan/dev/SPEC_Double-_Pass+_Record+_Reader.html>:" is not workin...
Good catch, I fixed this.
and I have one question: is it possible to write out to a file the things done in the first pass of the "double pass record reader"? I would like to write a function/program that is only working on a given partition. If I repeat this n times I think it would be beneficial to put the first pass of the reading into an external program...
Hm, you could try to use MultiSeqFile, following this tutorial here: http://trac.seqan.de/wiki/HowTo/EfficientImportOfMillionsOfSequencesThis gives you random access, albeit without storing an index into your reads file. Currently, storing such indices is not possible.
However: I can see how this would be useful for processing chunks of data. Let me see whether we can cook up something reuseable in this direction. There is the FAI index format from samtools, but that's only for FASTA files.
*m