Schedule

LING052, Spring 2014

The notes for each class meeting will include the readings and other assigned background activities, such as exploring on-line resources or installing interactive software. Those notes will be posted long enough in advance to give you time to do the readings and other activities before class.

Assignments will be posted on the line corresponding to the date that they're due. Keep in mind that "assignments" here refers only to things that you're expected to turn in; the notes for each meeting include other assigned activities that you should do before class.

Week Date Notes / Readings / Activities Assignments
(1) 1/15 Overview and organization  
(2) 1/20 MLK DAY  
  1/22 Case study #1: The history of "losers"
(3) 1/27 Testing Krugman's theory
Case 1 Readings
 
  1/29 Bayes' Rule
 
(4) 2/3 A simple approach to binary classification Annotation #1
  2/5 Case study #2: Classifying News Stories  
(5) 2/10 Document similarity: TF/IDF  
  2/12 Better "vector space" models: LSA and Eigenwords  
(6) 2/17    
  2/19   Project 1
(7) 2/24    
  2/26    
(8) 3/3    
  3/5   Text Analytics Survey
  3/10 SPRING BREAK  
  3/12 SPRING BREAK  
(9) 3/17    
  3/19    
(10) 3/24    
  3/26 Shared datasets and "Common Task" programs;
"Fred Jelinek", 2010
 
(11) 3/31 Claude Shannon, "A Mathematical Theory of Communication", 1948;
"Prediction and Entropy of Printed English", 1950
 
  4/2 David Lazer et al., "The Parable of Google Flu", Science 3/14/2014
(Podcast interview) (Google Flu Trends)
 
(12) 4/7 Gary Marcus & Ernest Davis, "Eight (No, Nine!) Problems with Big Data", NYT 4/62014
  4/9    
(13) 4/14   Project 2
  4/16    
(14) 4/21    
  4/23    
(15) 4/28    
  4/30    
"log like