Special Feature

User Panel

My Panel

My Panel

Bookmark Science Articles

Recent News
Bookmark / Share This Science Site

Mathematical correction for fingerprint similarity measures to improve chemical retrieval.

Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Research Abstract Details 

Research Abstract Table of Contents

Jump to the:

  • Abstract Text of This Paper
  • Journal Published
  • MeSH Keywords of This Abstract
  • Chemicals and Substances Used in this Paper
  • Grants and Granting Agency of this Research
  • Database Accession Numbers Used in this Paper
  • Related Papers
  • Related Research Tags
  • Rate this Research Paper
  • Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Abstract Text:

    s joshua swamidassS Joshua Swamidass,pierre baldiPierre Baldi,

    In many modern chemoinformatics systems, molecules are represented by long binary fingerprint vectors recording the presence or absence of particular features or substructures, such as labeled paths or trees, in the molecular graphs. These long fingerprints are often compressed to much shorter fingerprints using a simple modulo operation. As the length of the fingerprints decreases, their typical density and overlap tend to increase, and so does any similarity measure based on overlap, such as the widely used Tanimoto similarity. Here we show that this correlation between shorter fingerprints and higher similarity can be thought of as a systematic error introduced by the fingerprint folding algorithm and that this systematic error can be corrected mathematically. More precisely, given two molecules and their compressed fingerprints of a given length, we show how a better estimate of their uncompressed overlap, hence of their similarity, can be derived to correct for this bias. We show how the correction can be implemented not only for the Tanimoto measure but also for all other commonly used measures. Experiments on various data sets and fingerprint sizes demonstrate how, with a negligible computational overhead, the correction noticeably improves the sensitivity and specificity of chemical retrieval.

    Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Publishing Authors By Initials

    sj swamidassSJ Swamidass,p baldiP Baldi,

    For similar natural sciences: mathematics research abstracts see: natural sciences: mathematics research

    PUBMED ID PMID:

    MEDLINE DATE: 2007 May-Jun

    Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Journal Published:

    PUBLICATION TYPE: Research Support, U.S. Gov't,

    Journal: Journal of chemical information and modeling

    VOLUME: 47

    Page Numbers: 952-64

    Journal Abbreviation:

    ISSN: 1549-9596

    DAY: 20

    MONTH: 04

    YEAR: 2007

    Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Information

    Number of References:

    LANGUAGE: eng

    NlmUniqueID: 101230060

    Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Keywords Mesh Terms:

    KEYWORDS: Mathematics

    MESH TERMS: methods

    Chemical & Substance for Abstract: Mathematical correction for fingerprint similarity measures to improve chemical retrieval. Information

    Substance Name:

    Registry Number:

    Grant and Affiliation Information for Mathematical correction for fingerprint similarity measures to improve chemical retrieval.

    AFFILIATION: Institute for Genomics and Bioinformatics, School of Information and Computer Sciences, University of California, Irvine, Irvine, California 92697-3435, USA.

    Country: United States

    United States Research PublicationUnited States Research Publication

    AGENCY: United States NLM

    GRANT: LM-07443-01

    ACRONYM: LM

    MEDLINETA: J Chem Inf Model

    REFSOURCE:

    DATABASENAME:

    ACCESSION NUMBER:

    Number Hits: 0

    Mathematical correction for fingerprint similarity measures to improve chemical retrieval Related Publications

     

    Molecular Station USER Menu

    Welcome to Molecular Station!

    You have to register before you can post on our forums or use our advanced features. Register Now! Its Free and Fast!

    Already registered? Login now below.

    User Name:

    Password:

    Already registered and Forgot your password? Click below to recover it.

    Recover Lost Password

    Join now - it's fast and free!

    Molecular Station is THE largest network of researchers, scientists and science lovers anywhere!

    Research Terms of Usage and Disclaimer
    Home
    Features

    Protocols

    DNA Forum

    Science Forum

    DNA Forum
    Biology Forum

    Science News


    [CaRP] XML error: Invalid document end at line 2

    For more click here:Science News