Special Feature

User Panel

My Panel

My Panel

Bookmark Science Articles

Recent News
Bookmark / Share This Science Site

Logistic regression for disease classification using microarray data: model selection in a large p and small n case.

Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Research Abstract Details 

Research Abstract Table of Contents

Jump to the:

  • Abstract Text of This Paper
  • Journal Published
  • MeSH Keywords of This Abstract
  • Chemicals and Substances Used in this Paper
  • Grants and Granting Agency of this Research
  • Database Accession Numbers Used in this Paper
  • Related Papers
  • Related Research Tags
  • Rate this Research Paper
  • Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Abstract Text:

    j g liaoJ G Liao,khew-voon chinKhew-Voon Chin,

    MOTIVATION: Logistic regression is a standard method for building prediction models for a binary outcome and has been extended for disease classification with microarray data by many authors. A feature (gene) selection step, however, must be added to penalized logistic modeling due to a large number of genes and a small number of subjects. Model selection for this two-step approach requires new statistical tools because prediction error estimation ignoring the feature selection step can be severely downward biased. Generic methods such as cross-validation and non-parametric bootstrap can be very ineffective due to the big variability in the prediction error estimate. RESULTS: We propose a parametric bootstrap model for more accurate estimation of the prediction error that is tailored to the microarray data by borrowing from the extensive research in identifying differentially expressed genes, especially the local false discovery rate. The proposed method provides guidance on the two critical issues in model selection: the number of genes to include in the model and the optimal shrinkage for the penalized logistic regression. We show that selecting more than 20 genes usually helps little in further reducing the prediction error. Application to Golub's leukemia data and our own cervical cancer data leads to highly accurate prediction models. AVAILABILITY: R library GeneLogit at http://geocities.com/jg_liao

    Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Publishing Authors By Initials

    jg liaoJG Liao,kv chinKV Chin,

    For similar biological factors: biological markers: tumor markers, biological research abstracts see: biological factors: biological markers: tumor markers, biological research

    PUBMED ID PMID:

    MEDLINE DATE:

    Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Journal Published:

    PUBLICATION TYPE: Research Support, N.I.H., Extr

    Journal: Bioinformatics (Oxford, England)

    VOLUME: 23

    Page Numbers: 1945-51

    Journal Abbreviation: Bioinformatics

    ISSN: 1460-2059

    DAY: 31

    MONTH: 05

    YEAR: 2007

    Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Information

    Number of References:

    LANGUAGE: eng

    NlmUniqueID: 9808944

    Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Keywords Mesh Terms:

    KEYWORDS: Tumor Markers, Biological

    MESH TERMS: analysis

    Chemical & Substance for Abstract: Logistic regression for disease classification using microarray data: model selection in a large p and small n case. Information

    Substance Name: Tumor Markers, Biological

    Registry Number: 0

    Grant and Affiliation Information for Logistic regression for disease classification using microarray data: model selection in a large p and small n case.

    AFFILIATION: Drexel University School of Public Health, Philadelphia, PA 19102, USA. jl544@drexel.edu

    Country: England

    England Research PublicationEngland Research Publication

    AGENCY: United States NCI

    GRANT: 2P30 CA 72720-04

    ACRONYM: CA

    MEDLINETA: Bioinformatics

    REFSOURCE:

    DATABASENAME:

    ACCESSION NUMBER:

    Number Hits: 0

    Logistic regression for disease classification using microarray data: model selection in a large p and small n case Related Publications

     

    Molecular Station USER Menu

    Welcome to Molecular Station!

    You have to register before you can post on our forums or use our advanced features. Register Now! Its Free and Fast!

    Already registered? Login now below.

    User Name:

    Password:

    Already registered and Forgot your password? Click below to recover it.

    Recover Lost Password

    Join now - it's fast and free!

    Molecular Station is THE largest network of researchers, scientists and science lovers anywhere!

    Research Terms of Usage and Disclaimer
    Home
    Features

    Protocols

    DNA Forum

    Science Forum

    DNA Forum
    Biology Forum

    Science News


    [CaRP] XML error: Invalid document end at line 2

    For more click here:Science News