Go Back   Molecular Biology Forum > Molecular Research Topics Forum > Bioinformatics
Register Blogs FAQ Members List Calendar Science Groups New! Arcade Search Today's Posts Mark Forums Read

Bioinformatics Have questions about bioinformatic tools or databases? Post questions here. Discuss and post interesting bioinformatics information.


Funny BLAST Results on Drosophila

Bioinformatics

Have questions about bioinformatic tools or databases? Post questions here. Discuss and post interesting bioinformatics information.



Register Molecular Biology Forums
Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 12-15-2007, 02:57 AM
Pipette Filler
Points: 242, Level: 4Points: 242, Level: 4Points: 242, Level: 4
Activity: 0%Activity: 0%Activity: 0%
 

Join Date: Dec 2007
Posts: 1
maymif RSS Feed
Default Funny BLAST Results on Drosophila

Here's my problem: I'm trying to find sequences of nucleotides in the 3' UTRs of the Drosophila melanogaster genome that can be used to uniquely identify genes. My approach was to run a portion of the gene in question through BLAST (to find matches elsewhere in the genome). I then ran the BLAST results through a simple Perl script I wrote to develop a histogram of the frequency of matches for a given nucleotide. The first figure shows the results I got for the 5-HT7 gene (short name).

Based on these results, I figured that any sequence within the region of zero frequency would be unique in the genome. I then took the first 100 nucleotides of the original sample and ran then through BLAST again. I got the histogram in the second figure.

Clearly, there were a few matches (14 to be exact) that did not show up on the first run through BLAST. This was irksome, but not really problematic. Just for grins and giggles, I then took only the first 23 nucleotides and ran them once more through BLAST. The results are shown in the third figure. This is problematic.

Any idea why BLAST shows different results depending on the length of the sequence I use? Thanks.
Attached Images
File Type: jpg figure1_1to742.jpg (13.0 KB, 3 views)
File Type: jpg figure2_1to100.jpg (15.0 KB, 2 views)
File Type: jpg figure3_1to23.jpg (16.2 KB, 3 views)
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Reply With Quote
Alt Today
Advertising
Google Adsense
 
This advertising will not be shown
in this way to registered members.
Register your free account today
and become a member on
Molecular Biology Forum
Standard Sponsored Links

  #2 (permalink)  
Old 12-17-2007, 03:12 PM
Graduate Student
Points: 1,630, Level: 24Points: 1,630, Level: 24Points: 1,630, Level: 24
Activity: 9%Activity: 9%Activity: 9%
 

Join Date: Aug 2006
Location: Ithaca, NY
Posts: 122
kmunson779 RSS Feed
Default Re: Funny BLAST Results on Drosophila

I'm not exactly sure what you're doing -- for B, for example, you took 100 nucleotides. Did you run BLAST once on those 100 nucleotides & collate your histogram from multiple hits you got on that sequence, or did you take subsets of the sequence and run BLAST multiple times?

The shorter the string you send to BLAST, the more statistically likely it is that it will match the genome sequence.

You can increase the stringency of the BLAST search by playing with which algorithm you use, and the expect threshold, word size, and scoring parameters.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!Spurl this Post!Reddit!
Reply With Quote
Reply

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Domain/motif and double BLAST hits? stewdew Bioinformatics 0 04-18-2008 07:58 PM


All times are GMT. The time now is 07:23 AM.


Powered by vBulletin® Version 3.7.1
Copyright ©2000 - 2008, Jelsoft Enterprises Ltd.
Copyright 2005-2007 Molecular Station | All Rights Reserved