Re: [Seqan-dev] Client-server lambda?

From: Jose Manuel Duarte <jose.duarte@psi.ch>
To: Hannes Hauswedell <hannes.hauswedell@fu-berlin.de>
Date: Mon, 11 May 2015 15:11:29 +0200
Cc: SeqAn Development <seqan-dev@lists.fu-berlin.de>
Reply-to: SeqAn Development <seqan-dev@lists.fu-berlin.de>
Subject: Re: [Seqan-dev] Client-server lambda?

Have you tried storing the database file (including lambda's files) in a
shared memory filesystem, e.g. /dev/shm ? If you do this all data will already
be in main memory when the program is started -- however it will still need to
be copied around, so of course its not optimal. Also during program run-time
the sequences will both be in the program's allocated memory and in the shm,
so they will effectively use double the space. But it might still be
worthwhile for you, I can't say without knowing the exact use-case and
hardware available.


I've tried /dev/shm already but it didn't make a difference.

Here are the runtimes from 3 consecutive runs (269 sequences in one fileagainst a database that takes 7.7GB in plain text fasta file).


Reading from disk:

real    1m49.591s
real    1m49.259s
real    1m49.282s

Reading from /dev/shm:

real    1m49.480s
real    1m49.290s
real    1m49.007s

As you say the data still needs to be copied around and that is mostlikely where most of that time is spent (steps "Loading Subj Sequences"and "Loading Subj Ids" seem to be the slow ones).

My guess is that there's also a lot of disk buffering happening whenit's read from disk (the system I'm running has 128GB of memory and notso loaded at the moment, so I'm sure it has enough memory to keep allthe files in the buffer cache), that's my explanation as to why there'snot much difference between the disk and the /dev/shm runs.


Cheers

Jose

<-- thread -->

<-- date -->

References:
- [Seqan-dev] Client-server lambda?
  - From: Jose Manuel Duarte <jose.duarte@psi.ch>
- Re: [Seqan-dev] Client-server lambda?
  - From: Hannes Hauswedell <hannes.hauswedell@fu-berlin.de>

seqan-dev - May 2015 - Archives indexes sorted by:
[ thread ] [ subject ] [ author ] [ date ]
Complete archive of the seqan-dev mailing list
More info on this list...