PAC learning with irrelevant attributes

Aditi Dhagat, Lisa Hellerstein

    Research output: Contribution to journalConference articlepeer-review

    Abstract

    We consider the problem of learning in the presence of irrelevant attributes in Valiant's PAC model [V84]-In the PAC model, the goal of the learner is to produce an approximately correct hypothesis from random sample data. If the number of relevant attributes in the target function is small, it may be desirable to produce a hypothesis that also depends on only a small number of variables. Haussler [H88] previously considered the problem of learning monomials of a small number of variables. He showed that the greedy set cover approximation algorithm can be used as a polynomial-time Occam algorithm for learning monomials on r of n variables. It outputs a monomial on r(lnq + 1) variables, where q is the number of negative examples in the sample. We extend this result by showing that there is a polynomial-time Occam algorithm for learning k-term DNF formulas depending on r of n variables that outputs a DNF formula depending on 0(rk logk q) variables, where q is the number of negative examples in the sample. We also give a polynomial-time Occam algorithm for learning decision lists (sometimes called 1-decision lists) with k alternations. It outputs a decision list with k alternations depending on 0(rk logk m) variables, where m is the size of the sample. Using recent non-approximability techniques, Hancock, Jiang, Li, and Tromp [HJLT94] have shown that, unless NP C DTIME[2Poly(logn)], decision lists with k alternations cannot be approximated within a multiplicative factor o/log n and decision lists with an unbounded number of alternations cannot be approximated in polynomial time within a multiplicative factor of 2 l o g γ n for any γ < 1.

    Original languageEnglish (US)
    Pages (from-to)64-74
    Number of pages11
    JournalProceedings - Annual IEEE Symposium on Foundations of Computer Science, FOCS
    DOIs
    StatePublished - 1994
    EventProceedings of the 35th IEEE Annual Symposium on Foundations of Computer Science - Santa Fe, NM, USA
    Duration: Nov 20 1994Nov 22 1994

    ASJC Scopus subject areas

    • General Computer Science

    Fingerprint

    Dive into the research topics of 'PAC learning with irrelevant attributes'. Together they form a unique fingerprint.

    Cite this