The Collection of Distributionally Idiosyncratic Items (CoDII)
CoDII was funded by the Deutsche Forschungsgemeinschaft and the Landesstiftung Baden-Württemberg.
Purpose and Design of CoDII
The Collection of Distributionally Idiosyncratic Items (CoDII) is a linguistic resource on lexical items which have highly idiosyncratic occurrence patterns.
CoDII was initiated within the project A5Distributional Idiosyncrasies of the Collaborative Research Centre (Sonderforschungsbereich) 441 at the University of
Tübingen.
Collaborators
The following people have contributed to the design and the compilation of CoDII.
Subcollections of CoDII
-
CoDII-BW.de
(Collection of Distributionally Idiosyncratic Items: German Bound Words / Sammlung unikaler Wörter des Deutschen, SuWD)
CoDII-BW.de contains information on 446 German bound words.
-
CoDII-BW.en
(Collection of Distributionally Idiosyncratic Items: English Bound Words)
CoDII-BW.en contains information on 77 English bound words.
-
CoDII-NPI.de
(Collection of Distributionally Idiosyncratic Items: Negative Polarity Items in German)
CoDII-NPI.de contains information on 165 German Negative Polarity Items.
-
CoDII-NPI.ro
(Collection of Distributionally Idiosyncratic Items: Negative Polarity Items in Romanian)
CoDII-NPI.ro contains information on 58 Romanian Negative Polarity Items.
-
CoDII-PPI.de
(Collection of Distributionally Idiosyncratic Items: Positive Polarity Items in German)
CoDII-PPI.de contains information on 88 German Positive Polarity Items.
Publications
- Jan-Philipp Soehn, Mingya Liu, Beata Trawinski and Gianina Iordachioaia (To appear).
Positive und Negative Polaritätselemente als lexikalische Einheiten mit Distributionsidiosynkrasien. In
Proceedings of the Europhras 2008.
- Beata Trawinski, Manfred Sailer, Jan-Philipp Soehn, Lothar Lemnitzer and Frank Richter (2008).
Cranberry Expressions in English and in German. In
Proceedings of the LREC Workshop Towards a Shared Task for Multiword Expressions (MWE 2008),
pp. 35-38. European Language Resources Association (ELRA): Marrakech, Morocco.
- Beata Trawinski and Jan-Philipp Soehn (2008).
A Multilingual Database of Polarity Items. In
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08).
European Language Resources Association (ELRA): Marrakech, Morocco.
- Beata Trawinski, Jan-Philipp Soehn, Manfred Sailer and Frank Richter (2008).
A Multilingual Electronic Database of Distributionally Idiosyncratic Items.
In Elisenda Bernal and Janet DeCesaris (Eds.),
Proceedings of the XIII Euralex International Congress,
Series: Activitats, Volume 20, pp. 1445-1451. Universitat Pompeu Fabra: Barcelona, Spain.
- Manfred Sailer and Beata Trawinski (2006).
Die Sammlung unikaler Wörter des Deutschen. Aufbauprinzipien und erste Auswertungsergebnisse
[The Collection of German Bound Words. Design Principles and First Evaluation].
In Annelies Häcki Buhofer and Harald Burger (Eds.), Phraseology in Motion I. Methoden und Kritik. Akten der Internationalen Tagung zur Phraseologie (Basel, 2004),
Series: Phraseologie und Parämiologie, Volume 19, pp. 439-450. Hohengehren: Schneider Verlag.
- Manfred Sailer and Beata Trawinski
(2006). The Collection of Distributionally Idiosyncratic
Items: A Multilingual Resource for
Linguistic Research. In
Proceedings of the
5th International Conference on Language Resources and Evaluation, LREC 2006,
pp. 471-474. Genoa, Italy.
Presentations
- August 13 - 16, 2008
Jan-Philipp Soehn, Mingya Liu, Beata Trawinski and Gianina Iordachioaia:
Positive und Negative Polaritätselemente als lexikalische Einheiten mit Distributionsidiosynkrasien.
Talk given at Europhras 2008,
Helsinki, Finland.
- Juli 15 - 19, 2008
Beata Trawinski, Jan-Philipp Soehn, Manfred Sailer and Frank Richter:
A Multilingual Electronic Database of Distributionally Idiosyncratic Items.
Poster presented at the XIII Euralex Internacional Congress,
Barcelona, Spain.
- May 1, 2008
Beata Trawinski, Manfred Sailer, Jan-Philipp Soehn, Lothar Lemnitzer and Frank Richter:
Cranberry Expressions in English and in German.
Talk given at the LREC
Workshop Towards a Shared
Task for Multiword Expressions (MWE 2008),
Marrakech, Morocco.
- May 28-30, 2008
Beata Trawinski and Jan-Philipp Soehn:
A Multilingual Database of Polarity Items.
Poster presented at the Sixth International Conference on Language Resources and Evaluation (LREC'08), Marrakech, Morocco.
- December 15, 2007
Mingya Liu, Frank Richter, Jan- Philipp Soehn and Beata Trawinski:
Distributionsidiosynkrasien in der Logischen Form.
Project report presented at the SFB-Tag, Tübingen, German.
- April 11-13, 2007
Beata Trawinski, Jan-Philipp Soehn and Frank Richter:
Modeling Distributionally Idiosyncratic Items in XML.
Poster presented at GLDV Conference 2007 (Biannual Conference of the Society for Computational Linguistics and Language Technology), Tübingen, Germany.
- March 08-10, 2007
Frank Richter, Jan-Philipp Soehn and Beata Trawinski:
Spotting, Collecting and Documenting NPIs.
Talk given at the Workshop on Negation and Polarity, Tübingen, Germany.
- October 5-7, 2006
Manfred Sailer: Modeling the Lexis-Grammar Interface in a Competence-Based Framework: The Case of Bound Words. Presentation at Exploring the
Lexis-Grammar Interface (ELeGI 2006), Hannover, Germany.
- May 22-28, 2006
Manfred Sailer and Beata Trawinski:
The Collection of Distributionally Idiosyncratic Items: A Multilingual Resource for Linguistic Research.
Talk given at the
5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy.
- December 20, 2004
Beata Trawinski: The Collection of Distributionally Idiosyncratic Items.
Invited talk given at the Institute of Computer Science at the
Polish Academy of Sciences in Warsaw, Poland.
- August 26-29, 2004
Manfred Sailer and Beata Trawinski:
Die Sammlung unikaler Wörter des Deutschen.
Aufbauprinzipien und erste Auswertungsergebnisse [The Collection of German Bound Words.
Design Principles and First Evaluation].
Talk given at EUROPHRAS 2004: Europäische Gesellschaft für Phraseologie
at the University of Basel, Switzerland.