<?xml version='1.0' encoding='UTF-8'?><?xml-stylesheet href="http://www.blogger.com/styles/atom.css" type="text/css"?><feed xmlns='http://www.w3.org/2005/Atom' xmlns:openSearch='http://a9.com/-/spec/opensearchrss/1.0/' xmlns:georss='http://www.georss.org/georss' xmlns:gd='http://schemas.google.com/g/2005' xmlns:thr='http://purl.org/syndication/thread/1.0'><id>tag:blogger.com,1999:blog-8665317644900962850</id><updated>2011-10-25T13:55:48.543-07:00</updated><category term='sentiment classification'/><category term='parsing'/><category term='learning theory'/><category term='map reduce'/><category term='domain adaptation'/><category term='grid'/><category term='natural language processing'/><title type='text'>Structured Learning</title><subtitle type='html'></subtitle><link rel='http://schemas.google.com/g/2005#feed' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/posts/default'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default?max-results=100'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/'/><link rel='hub' href='http://pubsubhubbub.appspot.com/'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><generator version='7.00' uri='http://www.blogger.com'>Blogger</generator><openSearch:totalResults>8</openSearch:totalResults><openSearch:startIndex>1</openSearch:startIndex><openSearch:itemsPerPage>100</openSearch:itemsPerPage><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-6545072666997256925</id><published>2007-07-03T08:20:00.001-07:00</published><updated>2007-07-03T08:20:49.656-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='natural language processing'/><title type='text'>Corrections to ACL Anthology URLs</title><content type='html'>&lt;p&gt;Thanks to an alert reader, I found out that several of the paper links in my &lt;a href="http://structlearn.blogspot.com/2007/07/interesting-papers-at-acl-and-emnlp.html"&gt;previous&lt;/a&gt; &lt;a href="http://structlearn.blogspot.com/2007/07/reposting-interesting-acl-and-conll.html"&gt;postings&lt;/a&gt; on ACL and EMNLP-CoNLL papers where incorrect. The problem was that some of the BibTeX entries in the ACL DVD distributed in Prague have wrong &lt;a href="http://acl.ldc.upenn.edu/"&gt;ACL Anthology&lt;/a&gt; links, and I derived these postings semi-authomatically from those BibTeX entries. I've edited the most recent posting to use the correct links.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-6545072666997256925?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/6545072666997256925/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=6545072666997256925' title='2 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/6545072666997256925'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/6545072666997256925'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/07/corrections-to-acl-anthology-urls.html' title='Corrections to ACL Anthology URLs'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>2</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-2486496269703765111</id><published>2007-07-02T18:10:00.001-07:00</published><updated>2007-07-04T10:58:52.817-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='natural language processing'/><title type='text'>Reposting interesting ACL and CoNLL-EMNLP papers</title><content type='html'>&lt;p&gt;I've now added a short comment to each paper. This list is created semi-automatically from &lt;a href="http://bibdesk.sourceforge.net/"&gt;BibDesk&lt;/a&gt; with a custom HTML export template and some minor post-editing. The red titles are my special picks.&lt;/p&gt;&lt;STYLE type="text/css"&gt;&lt;!--.content {text-align: justify;	float: right;	width: 95%; }.Title { font-weight: bolder;	font-size: 1.0em;	color: #225522; }.HighTitle { font-weight: bolder;	font-size: 1.0em;	color: red; }.Author { color: black; }.Journal, .Conference { font-style: italic; }.Date { color: black; }.Url { font-size: 0.8em; color: #c95e62; }.Abstract, .Annote {	font-size: 0.9em;	line-height: 100%; 	color: dark-gray;	padding-top: 10px; }--&gt;&lt;/STYLE&gt;&lt;ol&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Frustratingly Easy Domain Adaptation&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;H. Daume III&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;256--263&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1033"&gt;http://www.aclweb.org/anthology/P/P07/P07-1033&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Assumes both source and target labeled data. Instance features are replicated as "feature from source" and "fature from target". Results are surprisingly good for such a simple method. Why? It is easy to create a counterexample in which this does not work, so it would be important to characterize precisely when it works.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;A Bayesian Model for Discovering Typological Implications&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;H. Daume III and L. Campbell&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;65--72&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1009"&gt;http://www.aclweb.org/anthology/P/P07/P07-1009&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Induces relationships between typological features of languages from very sparse descriptive data. Finds relationships discussed in the comparative literature as well as some others that deserve investigation.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Sparse Information Extraction: Unsupervised Language Models to the Rescue&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;D. Downey, S. Schoenmackers, and O. Etzioni&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;696--703&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1088"&gt;http://www.aclweb.org/anthology/P/P07/P07-1088&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;The main problem with previous work on unsupervised extraction based on finding many instances of a putative entity or relationship is that it has low recall. To address this, this paper creates HMM models from the contexts of common extractions and uses them to measure the plausibility of rare candidate extractions. Simple idea with good results.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;J. Gao, G. Andrew, M. Johnson, and K. Toutanova&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;824--831&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1104"&gt;http://www.aclweb.org/anthology/P/P07/P07-1104&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Bottom line: L_1 regularization of logistic regression does not hurt generalization, and makes the models much smaller. Nice to have a careful study that documents the benefits and limitations of L_1 regularization in a range of common text classification tasks.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="HighTitle"&gt;Unsupervised Coreference Resolution in a Nonparametric Bayesian Model&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;A. Haghighi and D. Klein&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;848--855&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1107"&gt;http://www.aclweb.org/anthology/P/P07/P07-1107&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;A very nice result beautifully presented. The "magic" of Dirichlet processes yields an unsupervised generative model of coreference that competes with supervised methods and can naturally incorporate a discourse model.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;K-best Spanning Tree Parsing&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;K. Hall&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;392--399&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1050"&gt;http://www.aclweb.org/anthology/P/P07/P07-1050&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Digging into old directed spanning tree literature continues to bear fruit for dependency parsing, this time a k-best algorithm that can be used for reranking with global features. I have my reservations about reranking, but this is a good addition to the dependency parsing toolbox.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Forest Rescoring: Faster Decoding with Integrated Language Models&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;L. Huang and D. Chiang&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;144--151&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1019"&gt;http://www.aclweb.org/anthology/P/P07/P07-1019&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;I like this aproach much better than reranking: evaluate global features as soon as possible and add their score with the local feature score in a dynamic programming parser or decoder, to produce efficiently an approximate set of k-best partial hypotheses. No such method for spanning tree dependency parsers, though... Liang gave a very clear talk.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Exploiting Wikipedia as External Knowledge for Named Entity Recognition&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;J. Kazama and K. Torisawa&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;698--707&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1073"&gt;http://www.aclweb.org/anthology/D/D07/D07-1073&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;The basic idea is simple and effective. For each entity described in Wikipedia, find a defining sentence, extract from it heuristically a noun that is likely to be the entity's category, and add that as a "label" feature to other features in a CRF extractor. The details are a bit complicated, but the accuracy improvements make it very worthwhile.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="HighTitle"&gt;Structured Prediction Models via the Matrix-Tree Theorem&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;T. Koo, A. Globerson, X. Carreras, and M. Collins&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;141--150&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1015"&gt;http://www.aclweb.org/anthology/D/D07/D07-1015&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Several groups discovered concurrently that Tutte's matrix-tree theorem would yield an efficient computation of the normalization for log-linear models of non-projective dependencies. There were three papers on different aspects of this in Prague, one at IWPT which I didn't see, and two at EMNLP. I selected this one because it shows how to cast several learning methods (log-linear and max-margin) into a common framework with very good results. The talk by Terry Koo was clear and convincing.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Mildly Context-Sensitive Dependency Languages&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;M. Kuhlmann and M. M\"ohl&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;160--167&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1021"&gt;http://www.aclweb.org/anthology/P/P07/P07-1021&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;The complexity of dependency grammar parsing is related to formal measured os their degree of nonprojectivlty following an approach first introduced for mildly context-sensitive grammars. Dependency grammar is a formal island no longer.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;The Infinite PCFG Using Hierarchical Dirichlet Processes&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;P. Liang, S. Petrov, M. Jordan, and D. Klein&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;688--697&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1072"&gt;http://www.aclweb.org/anthology/D/D07/D07-1072&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Another idea that was in the air got two papers in Prague: hierarchical Dirichlet processes for unsupervised PCFG induction. I liked this paper better, as Percy Liang gave a beautifully clear exposition of a method that was pretty opaque in most previous presentations of related work. It must have helped that Percy and Dan Klein had given a tutorial on Bayesian nonparametric models a few days before.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Characterizing the Errors of Data-Driven Dependency Parsing Models&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;R. McDonald and J. Nivre&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;122--131&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1013"&gt;http://www.aclweb.org/anthology/D/D07/D07-1013&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;It was rather intriguing at last year's CoNLL evaluation of multilingual dependency parsing that the two top parsers (our MSTParser and Nivre's MaltParser) had overall scores that were statistically indistinguishable, even though they are very different in design. This paper explains the results: MaltParser's greedy deterministic method can use more context and works best on shorter sentences, but greed hurts it on longer sentences. MSTParser uses just local features, so it suffers on shorter sentences, but optimal search makes it do better on longer sentences. How can we combine these benefits? I know, I know, parser combination in the Sagai and Lavie mold can do it, but I'd prefer something more integrated.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Structured Models for Fine-to-Coarse Sentiment Analysis&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;R. McDonald, K. Hannan, T. Neylon, M. Wells, and J. Reynar&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;432--439&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1055"&gt;http://www.aclweb.org/anthology/P/P07/P07-1055&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Combines document-level and sentence-level sentiment classification into a simple, easy to train structured model. Outperforms previous methods significantly at the sentence level, and does competitively at the document level.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="HighTitle"&gt;Learning Structured Models for Phone Recognition&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;S. Petrov, A. Pauls, and D. Klein&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;897--905&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1094"&gt;http://www.aclweb.org/anthology/D/D07/D07-1094&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Learn the structure of phone models from the data, rather than postulating a fixed structure in advance. Exploit it to represent context dependency concisely. Great paper, excellently presented. I tried to convince some speech colleagues that this could be done over ten years ago, but they were skeptical. It was probably too early, and this paper does it with way better methods than I had then. &lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Guided Learning for Bidirectional Sequence Classification&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;L. Shen, G. Satta, and A. Joshi&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;760--767&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1096"&gt;http://www.aclweb.org/anthology/P/P07/P07-1096&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Learn a linear sequence model for a problem where exhaustive search is not possible by starting from high-confidence labels and learning which actions to apply to extend the high-confidence regions to a full labeling of the sequence. Best Penn Treebank POS tagging results ever, and the method applies easily to other tagging and parsing problems. Who needs reranking now?&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;J. Suzuki, A. Fujino, and H. Isozaki&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;791--800&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1083"&gt;http://www.aclweb.org/anthology/D/D07/D07-1083&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;I don't understand this paper fully yet -- the notation and presentation are pretty dense -- but the idea of learning together a CRF from labeled data and HMMs for the same state space from unlabeled data is an intriguing approach to semi-supervised CRF training. One of my post-conference homeworks is to figure our how this does (or not) relate with ASO. Lots of other possible connections, such as Pal and McCallum's multiconditional models. &lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Randomised Language Modelling for Statistical Machine Translation&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;D. Talbot and M. Osborne&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;512--519&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1065"&gt;http://www.aclweb.org/anthology/P/P07/P07-1065&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;I can't quite evaluate this, since I've not been working on large language models recently, but there's something deliciously preverse about using randomized hashing to throw away n-gram data from a big model that doesn't really matter in practice.&lt;/span&gt;&lt;/li&gt;&lt;li class="Pub"&gt;	&lt;span class="HighTitle"&gt;Online Learning of Relaxed CCG Grammars for Parsing to Logical Form&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;L. Zettlemoyer and M. Collins&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;678--687&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1071"&gt;http://www.aclweb.org/anthology/D/D07/D07-1071&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;        &lt;span class="Annote"&gt;Given a training set of sentence-meaning pairs, a CCG-based lexicon induction process discovers potential word meanings and category assignments, and a ranking of alternatives, so that the given training set is correctly analyzed and interpreted. I love this connection between online learning, categorial grammars, and logical semantics, and I think there's a rich vein to explore here.&lt;/span&gt;&lt;/li&gt;&lt;/ol&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-2486496269703765111?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/2486496269703765111/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=2486496269703765111' title='1 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/2486496269703765111'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/2486496269703765111'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/07/reposting-interesting-acl-and-conll.html' title='Reposting interesting ACL and CoNLL-EMNLP papers'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>1</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-5647122715721569588</id><published>2007-07-01T19:01:00.001-07:00</published><updated>2007-07-01T19:39:44.824-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='natural language processing'/><title type='text'>Interesting papers at ACL and EMNLP-CoNLL</title><content type='html'>&lt;p&gt;I just got back from ACL and EMNLP-CoNLL in Prague. There were many interesting papers, more than I could attend because of session conflices. Here are some that I found especially worthwhile. Those highlighted in red really stood out. I'll add comments on some of these papers later, but I don't have time now.&lt;/p&gt;&lt;STYLE type="text/css"&gt;&lt;STYLE type="text/css"&gt;&lt;!--.content { background-color: #fffeec;text-align: justify; float: right;	width: 95%;	padding: 10px 1%;	margin: 10px 1%;   border-style: solid;border-width: 1px;border-color: #ead388; }.Title { font-weight: bolder;	font-size: 1.0em; color: #225522; }.HighTitle { font-weight: bolder; font-size: 1.0em; color: red; }.Author { color: black; }.Journal, .Conference { font-style: italic; }.Volume { font-weight: bolder; }.Date { color: black; }.Url { font-size: 0.8em; color: #c95e62; }--&gt;&lt;/STYLE&gt;&lt;ul&gt;&lt;li class="Pub"&gt;&lt;span class="HighTitle"&gt;Unsupervised Coreference Resolution in a Nonparametric Bayesian Model&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;A. Haghighi and D. Klein&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;848--855&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-0107"&gt;http://www.aclweb.org/anthology/P/P07/P07-0107&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="HighTitle"&gt;Structured Prediction Models via the Matrix-Tree Theorem&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;T. Koo, A. Globerson, X. Carreras, and M. Collins&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;141--150&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1015"&gt;http://www.aclweb.org/anthology/D/D07/D07-1015&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="HighTitle"&gt;Learning Structured Models for Phone Recognition&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;S. Petrov, A. Pauls, and D. Klein&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;897--905&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1094"&gt;http://www.aclweb.org/anthology/D/D07/D07-1094&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="HighTitle"&gt;Online Learning of Relaxed CCG Grammars for Parsing to Logical Form&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;L. Zettlemoyer and M. Collins&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;678--687&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1071"&gt;http://www.aclweb.org/anthology/D/D07/D07-1071&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;Exploiting Wikipedia as External Knowledge for Named Entity Recognition&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;J. Kazama and K. Torisawa&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;698--707&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1073"&gt;http://www.aclweb.org/anthology/D/D07/D07-1073&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;J. Suzuki, A. Fujino, and H. Isozaki&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;791--800&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1083"&gt;http://www.aclweb.org/anthology/D/D07/D07-1083&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;The Infinite PCFG Using Hierarchical Dirichlet Processes&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;P. Liang, S. Petrov, M. Jordan, and D. Klein&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt; EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;688--697&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1072"&gt;http://www.aclweb.org/anthology/D/D07/D07-1072&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Characterizing the Errors of Data-Driven Dependency Parsing Models&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;R. McDonald and J. Nivre&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;EMNLP-CoNLL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;122--131&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/D/D07/D07-1013"&gt;http://www.aclweb.org/anthology/D/D07/D07-1013&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;A Comparative Study of Parameter Estimation Methods for Statistical Natural Language Processing&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;J. Gao, G. Andrew, M. Johnson, and K. Toutanova&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;824--831&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-0104"&gt;http://www.aclweb.org/anthology/P/P07/P07-0104&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Guided Learning for Bidirectional Sequence Classification&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;L. Shen, G. Satta, and A. Joshi&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;760--767&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-0096"&gt;http://www.aclweb.org/anthology/P/P07/P07-0096&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Sparse Information Extraction: Unsupervised Language Models to the Rescue&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;D. Downey, S. Schoenmackers, and O. Etzioni&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;696--703&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-0088"&gt;http://www.aclweb.org/anthology/P/P07/P07-0088&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;Randomised Language Modelling for Statistical Machine Translation&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;D. Talbot and M. Osborne&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;512--519&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-0065"&gt;http://www.aclweb.org/anthology/P/P07/P07-0065&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;	&lt;span class="Title"&gt;Structured Models for Fine-to-Coarse Sentiment Analysis&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;R. McDonald, K. Hannan, T. Neylon, M. Wells, and J. Reynar&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;432--439&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1055"&gt;http://www.aclweb.org/anthology/P/P07/P07-1055&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;K-best Spanning Tree Parsing&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;K. Hall&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;392--399&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1050"&gt;http://www.aclweb.org/anthology/P/P07/P07-1050&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;Frustratingly Easy Domain Adaptation&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;H. Daume III&lt;/span&gt;&lt;br /&gt;	&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;256--263&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1033"&gt;http://www.aclweb.org/anthology/P/P07/P07-1033&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;A Bayesian Model for Discovering Typological Implications&lt;/span&gt;&lt;br /&gt;&lt;span class="Author"&gt;H. Daume III and L. Campbell&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;&lt;span class="Pages"&gt;65--72&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;	&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1009"&gt;http://www.aclweb.org/anthology/P/P07/P07-1009&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;Forest Rescoring: Faster Decoding with Integrated Language Models&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;L. Huang and D. Chiang&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;144--151&lt;/span&gt;&amp;nbsp;	(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1019"&gt;http://www.aclweb.org/anthology/P/P07/P07-1019&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;li class="Pub"&gt;&lt;span class="Title"&gt;Mildly Context-Sensitive Dependency Languages&lt;/span&gt;&lt;br /&gt;	&lt;span class="Author"&gt;M. Kuhlmann and M. M\"ohl&lt;/span&gt;&lt;br /&gt;&lt;span class="Conference"&gt;45th ACL&lt;/span&gt;&amp;nbsp;	&lt;span class="Pages"&gt;160--167&lt;/span&gt;&amp;nbsp;(&lt;span class="Date"&gt;2007&lt;/span&gt;)&lt;br /&gt;&lt;span class="Url"&gt;&lt;a href="http://www.aclweb.org/anthology/P/P07/P07-1021"&gt;http://www.aclweb.org/anthology/P/P07/P07-1021&lt;/a&gt;&lt;/span&gt;&lt;br /&gt;&lt;/li&gt;&lt;br /&gt;&lt;/ul&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-5647122715721569588?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/5647122715721569588/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=5647122715721569588' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/5647122715721569588'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/5647122715721569588'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/07/interesting-papers-at-acl-and-emnlp.html' title='Interesting papers at ACL and EMNLP-CoNLL'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-6005056709440452186</id><published>2007-06-18T07:38:00.001-07:00</published><updated>2007-06-18T07:39:25.623-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='map reduce'/><category scheme='http://www.blogger.com/atom/ns#' term='natural language processing'/><category scheme='http://www.blogger.com/atom/ns#' term='grid'/><title type='text'>Data Catalysis</title><content type='html'>&lt;p&gt;Mark Liberman &lt;a href="http://itre.cis.upenn.edu/%7Emyl/languagelog/archives/004615.html"&gt;recommended&lt;/a&gt; Patrick Pantel's &lt;a href="http://www.isi.edu/%7Epantel/Download/Papers/2007/isuc07.pdf"&gt;Data Catalysis&lt;/a&gt; (Via &lt;a href="http://itre.cis.upenn.edu/~myl/languagelog/"&gt;Language Log&lt;/a&gt;.)&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-6005056709440452186?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/6005056709440452186/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=6005056709440452186' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/6005056709440452186'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/6005056709440452186'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/06/data-catalysis.html' title='Data Catalysis'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-7652881742423250247</id><published>2007-06-16T12:22:00.001-07:00</published><updated>2007-06-16T12:23:39.457-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='domain adaptation'/><category scheme='http://www.blogger.com/atom/ns#' term='parsing'/><title type='text'>Frustratingly Hard Domain Adaptation for Parsing</title><content type='html'>&lt;pre&gt;@inproceedings{Dredze07Frustratingly,&lt;br /&gt;author = {Mark Dredze and John Blitzer and Pratha Pratim Talukdar and Kuzman Ganchev and Joao Graca and Fernando Pereira},&lt;br /&gt;title = {Frustratingly Hard Domain Adaptation for Parsing},&lt;br /&gt;booktitle = "Conference on Natural Language Learning",&lt;br /&gt;address = "Prague, Czech Republic"&lt;br /&gt;year = "2007"&lt;br /&gt;}&lt;/pre&gt;&lt;br /&gt;&lt;p&gt;We will be presenting &lt;a href="http://www.cis.upenn.edu/~blitzer/papers/parsing.pdf"&gt;our struggles&lt;/a&gt; with domain adaptation for parsing at CoNLL the week after next in Prague.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-7652881742423250247?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/7652881742423250247/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=7652881742423250247' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/7652881742423250247'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/7652881742423250247'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/06/frustratingly-hard-domain-adaptation.html' title='Frustratingly Hard Domain Adaptation for Parsing'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-8064718323719058099</id><published>2007-06-16T12:19:00.001-07:00</published><updated>2007-06-16T12:24:09.114-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='domain adaptation'/><category scheme='http://www.blogger.com/atom/ns#' term='sentiment classification'/><title type='text'>Biographies, Bollywood, Boom-boxes, and Blenders: Domain Adaptation for Sentiment Classification</title><content type='html'>&lt;pre&gt;@inproceedings{Blitzer07Biographies,&lt;br /&gt;author = {John Blitzer and Mark Dredze and Fernando Pereira},&lt;br /&gt;title = {Biographies, Bollywood, Boom-boxes and Blenders: Domain Adaptation for Sentiment Classification},&lt;br /&gt;booktitle = "Association for Computational Linguistics",&lt;br /&gt;address = "Prague, Czech Republic"&lt;br /&gt;year = "2007"&lt;br /&gt;}&lt;/pre&gt;&lt;br /&gt;&lt;p&gt;We will presenting &lt;a href="http://www.cis.upenn.edu/~blitzer/papers/sentiment_domain.pdf"&gt;this&lt;/a&gt; at ACL the week after next.&lt;/p&gt;&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-8064718323719058099?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/8064718323719058099/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=8064718323719058099' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/8064718323719058099'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/8064718323719058099'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/06/biographies-bollywood-boom-boxes-and.html' title='Biographies, Bollywood, Boom-boxes, and Blenders: Domain Adaptation for Sentiment Classification'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-686941167448425042</id><published>2007-06-16T12:16:00.001-07:00</published><updated>2007-06-16T12:24:26.506-07:00</updated><category scheme='http://www.blogger.com/atom/ns#' term='learning theory'/><title type='text'>Occam's Hammer</title><content type='html'>John Langford &lt;a href="http://hunch.net/?p=272"&gt;blogged about&lt;/a&gt; &lt;a href="http://cvlab.epfl.ch/~fleuret/papers/blanchard-fleuret-colt2007.pdf"&gt;Occam's Hammer&lt;/a&gt;, which I've started reading. I agree with John that this is an interesting new way of proving tight generalization bounds, which is on my mind because of some papers we submitted for publication recently.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-686941167448425042?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/686941167448425042/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=686941167448425042' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/686941167448425042'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/686941167448425042'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/06/occam-hammer.html' title='Occam&amp;#39;s Hammer'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry><entry><id>tag:blogger.com,1999:blog-8665317644900962850.post-5181567896704002881</id><published>2007-06-16T12:06:00.000-07:00</published><updated>2007-06-16T12:10:56.423-07:00</updated><title type='text'>What's new in my research?</title><content type='html'>I'll be blogging here what research papers I am reading and what I have written, with very little commentary. I don't know if it will be useful, but I would find it very useful if researchers I follow had such blogs, so I decided to do my bit.&lt;div class="blogger-post-footer"&gt;&lt;img width='1' height='1' src='https://blogger.googleusercontent.com/tracker/8665317644900962850-5181567896704002881?l=structlearn.blogspot.com' alt='' /&gt;&lt;/div&gt;</content><link rel='replies' type='application/atom+xml' href='http://structlearn.blogspot.com/feeds/5181567896704002881/comments/default' title='Post Comments'/><link rel='replies' type='text/html' href='http://www.blogger.com/comment.g?blogID=8665317644900962850&amp;postID=5181567896704002881' title='0 Comments'/><link rel='edit' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/5181567896704002881'/><link rel='self' type='application/atom+xml' href='http://www.blogger.com/feeds/8665317644900962850/posts/default/5181567896704002881'/><link rel='alternate' type='text/html' href='http://structlearn.blogspot.com/2007/06/whats-new-in-my-research.html' title='What&apos;s new in my research?'/><author><name>Fernando Pereira</name><email>noreply@blogger.com</email><gd:image rel='http://schemas.google.com/g/2005#thumbnail' width='32' height='32' src='//lh4.googleusercontent.com/-m0RNMya3Dlw/AAAAAAAAAAI/AAAAAAAAAAA/LZqK1bhE0Vg/s512-c/photo.jpg'/></author><thr:total>0</thr:total></entry></feed>
