Helping to find the usefulness of a proposal

Joe Darcy Joe.Darcy at Sun.COM
Thu Apr 2 19:08:43 PDT 2009


brucechapman at paradise.net.nz wrote:
> Good idea,
> 
> Should they each be evaluated against the same corpus and what would 
> be a suitable corpus?
> 
> http://en.wikipedia.org/wiki/Corpus_linguistics

On that front, Alex sent me the following:

> Analysis of a micro-corpus of your own or your company's code is
> unscientific.
> 
> Ewan Tempero and his colleagues at the University of Auckland have 
> done excellent, peer-reviewed work on how Java language features are 
> used in real-world code. Their "Qualitas Corpus" consists of over 
> 100,000 classes - see http://www.cs.auckland.ac.nz/~ewan/corpus/
> 
> If someone showed that, say, a null check occurs on average every 15 
> lines of code in this corpus, and that null-safe operators could 
> remove those lines without adverse side effects, then that would be a
> real contribution to Project Coin.

I agree that using a standard, large corpus to empirically examine the 
utility of the Project Coin proposals would be a fine component of their 
evaluation.

-Joe



More information about the coin-dev mailing list