Main menu:


Add to Google

Subscribe via e-mail:


Site search

Categories

January 2007
M T W T F S S
    Feb »
1234567
891011121314
15161718192021
22232425262728
293031  

Archive

Archive for January 27th, 2007

The paper that would not die

Sources of Success for Boosted Wrapper Induction Journal of Machine Learning Research, Volume 5 Written October 2001, published December 2004 Download PDF (29 pages) Download PPT (900KB; presentation at Stanford’s Seminar for Computational Learning and Adaptation) I co-wrote this paper during the first summer I started doing NLP research, but it didn’t see the light [...]

A nifty NLP paper that never made it

Conditional Estimation of HMMs for Information Extraction Submitted to ACL 2003 Sapporo, Japan July 2003 Download PDF (8 pages) Download PPT (500KB; presentation to NLP group, including work discussed in this paper) This is another paper I wrote that didn’t get accepted for publication. Like my character-level paper, it was interesting and useful but not well [...]

Information Extraction for the Semantic Web

Finding Educational Resources on the Web: Exploiting Automatic Extraction of Metadata Workshop on Adaptive Text Extraction and Mining Cavtat-Dubrovnik, Croatia Sempetmber 22, 2003 Download PDF (4 pages) The Semantic Web is a great idea: expose all of the information on the web in a machine-readable format, and intelligent agents will the be able to read [...]

My most famous NLP paper (CoNLL-03)

Named Entity Recognition with Character-Level Models HLT-NAACL CoNLL-03 Shared Task Edmonton, Canada June 1, 2003 Download PDF (4 pages) Download PPT (3.8MB; presentation at CoNLL-03) Every year that Conference on Computational Natural Language Learning (CoNLL) has a “shared task” where they define a specific problem to solve, provide a standard data set to train your [...]

My first NLP research paper

Classifying Unknown Proper Noun Phrases Without Context Technical Report dbpubs/2002-46 Stanford University April 9, 2002 Download PDF (9 pages) Download PPT (1.3MB; presentation of the paper to the NLP group) As I describe in my post about my master’s thesis, I started doing research in Natural Language Processing after Chris Manning, the professor that taught [...]

My Stanford Master’s Thesis

Categorization by Character-Level Models: Exploiting the Sound Symbolism of Proper Names Master’s thesis, Symbolic Systems Program, Stanford University Christopher D. Manning, Advisor June 11, 2003 Download PDF (52 pages)  After four years as an undergraduate at Stanford, I wasn’t ready to leave yet. There were more classes I wanted to take, and I wanted to [...]

FOAF Workshop Talk

Technical and Privacy Challenges for Integrating FOAF into Existing Applications FOAF Workshop Galway, Ireland September 2, 2004 Full paper (HTML) Download PPT (2.1MB) FOAF stands for friend-of-a-friend and it’s an open standard for describing your contact information and who you know. When social networking sites started exploding, many people were annoyed that they had to [...]

Mashup University Talk

Smarten Up Your Address Book with a Plaxo Mashup Mashup University (part of Mashup Camp 2) Mountain View, CA (Computer History Museum) July 12, 2006 Download PPT (1.7MB) Watch the complete video of my talk (QT, 78MB) Plaxo sponsored me to give a talk at the beginning of MashupCamp2 (alongside speakers from Microsoft, AOL, and [...]

Cross-Site Ajax (OSCON 2006)

Cross-Site Ajax: Challenges and Techniques for Building Rich Web 2.0 Mashups O’Reilly Open Source Convention (OSCON) 2006 Portland, OR July 26, 2006 Download PPT (1.8MB) This was the first OSCON I ever attended. I had a great time and I met a lot of amazing people. I’m definitely going back next year. Much of what [...]