[Seqan-dev] Disk-based index

From: John Reid <j.reid@mail.cryst.bbk.ac.uk>
To: SeqAn Development <seqan-dev@lists.fu-berlin.de>
Date: Wed, 28 Aug 2013 09:54:30 +0100
Reply-to: SeqAn Development <seqan-dev@lists.fu-berlin.de>
Subject: [Seqan-dev] Disk-based index

Hi all,

I would like to index the mouse or human genome with an ESA. I need to do this more than once though and would like to store the ESA on disk as it takes some hours to construct. Is this feasible? Is there any way to do this in SeqAn already?

Also parallel construction is interesting to me. To quote wikipedia (http://en.wikipedia.org/wiki/Suffix_tree#External_construction):

ERA is a recent parallel suffix tree construction method that is significantly faster. ERA can index the entire human genome in 19 minutes on an 8-core desktop computer with 16GB RAM. On a simple Linux cluster with 16 nodes (4GB RAM per node), ERA can index the entire human genome in less than 9 minutes

Are there any plans to incorporate the ERA algorithm (http://www.vldb.org/pvldb/vol5/p049_essammansour_vldb2012.pdf) into SeqAn?

Thanks,
John.

<-- thread -->

<-- date -->

Follow-Ups:
- Re: [Seqan-dev] Disk-based index
  - From: "Siragusa, Enrico" <Enrico.Siragusa@fu-berlin.de>

seqan-dev - August 2013 - Archives indexes sorted by:
[ thread ] [ subject ] [ author ] [ date ]
Complete archive of the seqan-dev mailing list
More info on this list...