|  | 
 Invited Talk
Harvesting Linguistic Evidence from the Web with the LSE:The Case of the Benefactive Alternation
We examine the the English Benefactive Alternation (BA)
exemplified in (1) and (2):
 
(1a) Kim baked/bought/stole a cake for Chris(1b) Kim baked/bought/stole Chris a cake
 
 (2a) Kim decorated/frosted a cake for Chris
 (2a) *Kim decorated/frosted Chris a cake
 
 
The double object alternant, as in (1b, 2b), is highly constrained. 
Explications of the constraints, primarily in terms of lexico-semantic 
features, have been formulated by Green (1978) inter alia. 
But it turns out that these constraints are squishy. There exist both 
inter- and intra-speaker variation in the judgments of many Benefactive 
constructions. 
Rather than rely on speakers' inconclusive intuitions, we decided 
to examine the distribution of the BA by looking at the largest available 
collection of unsolicited data, the web. We find that most of the 
constraints posited for the BA on the basis of constructed data do not 
hold, and that the BA is in fact far less restricted than the literature 
would suggest. 
The attested data indicate one single necessary, but not sufficient, 
constraint on the alternation. 
 
To enable future empirical web-based work, we developed a tool called 
the Linguistic Search Engine, or LSE (Resnik, 2003), which allows us 
to search efficiently for specific target structures and lexical-syntactic 
patterns. We demonstrate the use of the LSE and briefly discuss the 
implications of using web data for linguistic theory.
 
This is joint work with Philip Resnik, Mari Olsen, and Aaron Elkiss. 
 |