<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
     xmlns:content="http://purl.org/rss/1.0/modules/content/"
     xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
     xmlns:atom="http://www.w3.org/2005/Atom"
     xmlns:dc="http://purl.org/dc/elements/1.1/"
     xmlns:wfw="http://wellformedweb.org/CommentAPI/"
     >
  <channel>
    <title>brain of mat kelcey</title>
    <link>http://matpalm.com/blog</link>
    <description>thoughts from a data scientist wannabe</description>
    <generator>Blogofile</generator>
    <sy:updatePeriod>hourly</sy:updatePeriod>
    <sy:updateFrequency>1</sy:updateFrequency>
    <item>
      <title>semi supervised naive bayes for text classification</title>
      <link>http://matpalm.com/blog/2010/02/14/semi-supervised-naive-bayes-for-text-classification/</link>
      <category><![CDATA[e13]]></category>
      <category><![CDATA[semi supervised]]></category>
      <category><![CDATA[naive bayes]]></category>
      <guid>http://matpalm.com/blog/?p=299</guid>
      <description>semi supervised naive bayes for text classification</description>
      <content:encoded><![CDATA[<p>experiment 13; <a href="http://matpalm.com/semi_supervised_naive_bayes/">a test of semi supervised naive bayes for text classification</a> is complete.</p>
<p>semi supervised algorithms seem to work pretty well and i can see how they are a huge benefit for text classification where you can have an enormous corpus but not enough time to label it all...</p>]]></content:encoded>
    </item>
    <item>
      <title>an intro to semi supervised document classification</title>
      <link>http://matpalm.com/blog/2010/01/31/an-intro-to-semi-supervised-document-classification/</link>
      <category><![CDATA[semi supervised]]></category>
      <category><![CDATA[naive bayes]]></category>
      <category><![CDATA[machine learning]]></category>
      <guid>http://matpalm.com/blog/?p=275</guid>
      <description>an intro to semi supervised document classification</description>
      <content:encoded><![CDATA[<p>here's a great <a href="http://videolectures.net/mlas06_mitchell_sla/">lecture</a> from <a href="http://www.cs.cmu.edu/~tom/">tom mitchell</a> about document classification using a semi supervised version of naive bayes.</p>
<p>semi supervised algorithms only require some of the training examples to be labeled and are able to make use of any unlabelled ones, very common when we have a huge corpus.</p>
<p>i've started an experiment brewing to test this out by porting some <a href="http://matpalm.com/rss.feed/p3/">previous naive bayes work</a> i did to use this semi supervised scheme and will published it when it's done.</p>
<p>cool stuff!!</p>]]></content:encoded>
    </item>
  </channel>
</rss>

