7:00 – 4:00 VTX
- Started the paper describing the slider interface
- TF-IDF today!
- Read docs from web and PDF
- Calculate the rank
- Create matrix of terms and documents, weighted by occurrence.
- Hmm. What I’m actually looking for is the lowest-occurring terms within a document that occur over the largest number of documents. I’ve used this page as a starting point. After flailing for many hours in java, I wound up walking through the algorithm in Excel and I think I’ve got it. This is the spreadsheet that embodies my delusional thinking ATM.