Special Feature

User Panel

My Panel

My Panel

Bookmark Science Articles

Recent News
Bookmark / Share This Science Site

Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications.

Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Research Abstract Details 

Research Abstract Table of Contents

Jump to the:

  • Abstract Text of This Paper
  • Journal Published
  • MeSH Keywords of This Abstract
  • Chemicals and Substances Used in this Paper
  • Grants and Granting Agency of this Research
  • Database Accession Numbers Used in this Paper
  • Related Papers
  • Related Research Tags
  • Rate this Research Paper
  • Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Abstract Text:

    haiyuan yuHaiyuan Yu,ronald jansenRonald Jansen,gustavo stolovitzkyGustavo Stolovitzky,mark gersteinMark Gerstein,

    MOTIVATION: Many classifications of protein function such as Gene Ontology (GO) are organized in directed acyclic graph (DAG) structures. In these classifications, the proteins are terminal leaf nodes; the categories 'above' them are functional annotations at various levels of specialization and the computation of a numerical measure of relatedness between two arbitrary proteins is an important proteomics problem. Moreover, analogous problems are important in other contexts in large-scale information organization--e.g. the Wikipedia online encyclopedia and the Yahoo and DMOZ web page classification schemes. RESULTS: Here we develop a simple probabilistic approach for computing this relatedness quantity, which we call the total ancestry method. Our measure is based on counting the number of leaf nodes that share exactly the same set of 'higher up' category nodes in comparison to the total number of classified pairs (i.e. the chance for the same total ancestry). We show such a measure is associated with a power-law distribution, allowing for the quick assessment of the statistical significance of shared functional annotations. We formally compare it with other quantitative functional similarity measures (such as, shortest path within a DAG, lowest common ancestor shared and Azuaje's information-theoretic similarity) and provide concrete metrics to assess differences. Finally, we provide a practical implementation for our total ancestry measure for GO and the MIPS functional catalog and give two applications of it in specific functional genomics contexts. AVAILABILITY: The implementations and results are available through our supplementary website at: http://gersteinlab.org/proj/funcsim. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

    Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Publishing Authors By Initials

    h yuH Yu,r jansenR Jansen,g stolovitzkyG Stolovitzky,m gersteinM Gerstein,

    For similar genetic phenomena: variation (genetics) research abstracts see: genetic phenomena: variation (genetics) research

    PUBMED ID PMID:

    MEDLINE DATE:

    Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Journal Published:

    PUBLICATION TYPE: Research Support, N.I.H., Extr

    Journal: Bioinformatics (Oxford, England)

    VOLUME: 23

    Page Numbers: 2163-73

    Journal Abbreviation: Bioinformatics

    ISSN: 1460-2059

    DAY: 31

    MONTH: 05

    YEAR: 2007

    Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Information

    Number of References:

    LANGUAGE: eng

    NlmUniqueID: 9808944

    Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Keywords Mesh Terms:

    KEYWORDS: Variation (Genetics)

    MESH TERMS: genetics

    Chemical & Substance for Abstract: Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications. Information

    Substance Name: Proteins

    Registry Number: 0

    Grant and Affiliation Information for Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications.

    AFFILIATION: Department of Molecular Biophysics & Biochemistry, Yale University, PO Box 208114, New Haven, CT 06520, USA.

    Country: England

    England Research PublicationEngland Research Publication

    AGENCY: United States NHGRI

    GRANT: P50 HG02357-01

    ACRONYM: HG

    MEDLINETA: Bioinformatics

    REFSOURCE:

    DATABASENAME:

    ACCESSION NUMBER:

    Number Hits: 0

    Total ancestry measure: quantifying the similarity in tree-like classification, with genomic applications Related Publications

     

    Molecular Station USER Menu

    Welcome to Molecular Station!

    You have to register before you can post on our forums or use our advanced features. Register Now! Its Free and Fast!

    Already registered? Login now below.

    User Name:

    Password:

    Already registered and Forgot your password? Click below to recover it.

    Recover Lost Password

    Join now - it's fast and free!

    Molecular Station is THE largest network of researchers, scientists and science lovers anywhere!

    Research Terms of Usage and Disclaimer
    Home
    Features

    Protocols

    DNA Forum

    Science Forum

    DNA Forum
    Biology Forum

    Science News


    [CaRP] XML error: Invalid document end at line 2

    For more click here:Science News