Special Feature

User Panel

My Panel

My Panel

Bookmark Science Articles

Recent News
Bookmark / Share This Science Site

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Research Abstract Details 

Research Abstract Table of Contents

Jump to the:

  • Abstract Text of This Paper
  • Journal Published
  • MeSH Keywords of This Abstract
  • Chemicals and Substances Used in this Paper
  • Grants and Granting Agency of this Research
  • Database Accession Numbers Used in this Paper
  • Related Papers
  • Related Research Tags
  • Rate this Research Paper
  • SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Abstract Text:

    iain melvinIain Melvin,eugene ieEugene Ie,rui kuangRui Kuang,jason westonJason Weston,william noble staffordWilliam Noble Stafford,christina leslieChristina Leslie,

    BACKGROUND: Predicting a protein's structural class from its amino acid sequence is a fundamental problem in computational biology. Much recent work has focused on developing new representations for protein sequences, called string kernels, for use with support vector machine (SVM) classifiers. However, while some of these approaches exhibit state-of-the-art performance at the binary protein classification problem, i.e. discriminating between a particular protein class and all other classes, few of these studies have addressed the real problem of multi-class superfamily or fold recognition. Moreover, there are only limited software tools and systems for SVM-based protein classification available to the bioinformatics community. RESULTS: We present a new multi-class SVM-based protein fold and superfamily recognition system and web server called SVM-Fold, which can be found at http://svm-fold.c2b2.columbia.edu. Our system uses an efficient implementation of a state-of-the-art string kernel for sequence profiles, called the profile kernel, where the underlying feature representation is a histogram of inexact matching k-mer frequencies. We also employ a novel machine learning approach to solve the difficult multi-class problem of classifying a sequence of amino acids into one of many known protein structural classes. Binary one-vs-the-rest SVM classifiers that are trained to recognize individual structural classes yield prediction scores that are not comparable, so that standard "one-vs-all" classification fails to perform well. Moreover, SVMs for classes at different levels of the protein structural hierarchy may make useful predictions, but one-vs-all does not try to combine these multiple predictions. To deal with these problems, our method learns relative weights between one-vs-the-rest classifiers and encodes information about the protein structural hierarchy for multi-class prediction. In large-scale benchmark results based on the SCOP database, our code weighting approach significantly improves on the standard one-vs-all method for both the superfamily and fold prediction in the remote homology setting and on the fold recognition problem. Moreover, our code weight learning algorithm strongly outperforms nearest-neighbor methods based on PSI-BLAST in terms of prediction accuracy on every structure classification problem we consider. CONCLUSION: By combining state-of-the-art SVM kernel methods with a novel multi-class algorithm, the SVM-Fold system delivers efficient and accurate protein fold and superfamily recognition.

    SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Publishing Authors By Initials

    i melvinI Melvin,e ieE Ie,r kuangR Kuang,j westonJ Weston,wn staffordWN Stafford,c leslieC Leslie,

    For similar information science: computing methodologies: software research abstracts see: information science: computing methodologies: software research

    PUBMED ID PMID:

    MEDLINE DATE:

    SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Journal Published:

    PUBLICATION TYPE: Research Support, U.S. Gov't,

    Journal: BMC bioinformatics

    VOLUME: 8 Suppl 4

    Page Numbers: S2

    Journal Abbreviation: BMC Bioinformatics

    ISSN: 1471-2105

    DAY: 22

    MONTH: 05

    YEAR: 2007

    SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Information

    Number of References:

    LANGUAGE: eng

    NlmUniqueID: 100965194

    SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Keywords Mesh Terms:

    KEYWORDS: Software

    MESH TERMS: methods

    Chemical & Substance for Abstract: SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition. Information

    Substance Name: Proteins

    Registry Number: 0

    Grant and Affiliation Information for SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

    AFFILIATION: NEC Laboratories of America, Princeton, NJ, USA. imelvin@gmail.com

    Country: England

    England Research PublicationEngland Research Publication

    AGENCY: United States NIGMS

    GRANT: GM74257

    ACRONYM: GM

    MEDLINETA: BMC Bioinformatics

    REFSOURCE:

    DATABASENAME:

    ACCESSION NUMBER:

    Number Hits: 0

    SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition Related Publications

     

    Molecular Station USER Menu

    Welcome to Molecular Station!

    You have to register before you can post on our forums or use our advanced features. Register Now! Its Free and Fast!

    Already registered? Login now below.

    User Name:

    Password:

    Already registered and Forgot your password? Click below to recover it.

    Recover Lost Password

    Join now - it's fast and free!

    Molecular Station is THE largest network of researchers, scientists and science lovers anywhere!

    Research Terms of Usage and Disclaimer
    Home
    Features

    Protocols

    DNA Forum

    Science Forum

    DNA Forum
    Biology Forum

    Science News


    [CaRP] XML error: Invalid document end at line 2

    For more click here:Science News