FU Logo
  • Startseite
  • Kontakt
  • Impressum
  • Home
  • Listenauswahl
  • Anleitungen

[Seqan-dev] Annotation and spliced site

thread
date
  • From: Yue Gan <gany.fml@gmail.com>
  • To: seqan-dev@lists.fu-berlin.de
  • Date: Mon, 13 Jun 2016 23:14:25 -0500
  • Reply-to: SeqAn Development <seqan-dev@lists.fu-berlin.de>
  • Subject: [Seqan-dev] Annotation and spliced site

Hi all,

I am trying to use the annotation part of Seqan to solve a question, but I am a little bit confused.

What I want to do is to
1. Read in an annotation file (the format can be GFF, GTF, or BED)
2. Giving two positions (q1, q2) in the genome, decide if (q1, q2) is a known junction.

My Solution:
1. use GffFile in to load the annotation file(GFF, GTF) into a FragmentStore<>
2. extract the exons and create a junction Interval tree using the FragmentStore<>
3. using interval`s 'findIntervals' function giving q1 and q2 to find if there is an interval(junction) exists.

My Question:
1. Is BED better then GFF and GTF in this situation?
2. If BED is better, could anybody give me some examples what seqan data structure I should use to store and do the search? Some pseudocode with seqan data structure and function could be better....
3. If GFF and GTF are better, is my solution correct and efficient? Is there any better way to solve my question? Is there anything I need to pay attention to?

Thank you very much!

Yue Gan
thread
date
  • seqan-dev - June 2016 - Archives indexes sorted by:
    [ thread ] [ subject ] [ author ] [ date ]
  • Complete archive of the seqan-dev mailing list
  • More info on this list...

Hilfe

  • FAQ
  • Dienstbeschreibung
  • ZEDAT Beratung
  • postmaster@lists.fu-berlin.de

Service-Navigation

  • Startseite
  • Listenauswahl

Einrichtung Mailingliste

  • ZEDAT-Portal
  • Mailinglisten Portal