Transcript ALR 2013
ALR 2013 Some observations Pushpak Bhattacharyya, ALR Chair Workshop Overview Contribution to the workshop in terms of Research papers Resource (paradigmatic) -EVB, COW Annotation (syntagmatic) Kind -Keynote speech** -Etymological -valence alternation and structure -event and event actor alignment Applications (contributes to resource) -Hindi SA -Malayalam NER -UNLization of Punjabi Agreement -legitimate disagreement -agreement tracking byt ET Suggestions at the conclusion of the workshop by participants • • • • Asian equivalent of ELRA (Europe), LDC (USA) Validation of data a concern Lots of tools getting reinvented, duplicated The catalogue of ALR should be revived Good progress on LR development in India: Indowordnet Noun Verb Adjective Adverb Total Hindi 28520 3144 6114 465 38243 Assamese 9065 1676 3805 412 14958 Bengali 27281 2804 5815 445 36346 Bodo 8788 2296 4287 414 15785 Gujarati 26503 2805 5828 445 35599 Kannada 11146 1642 3056 171 16016 Kashmiri 21041 2660 5365 400 29469 Konkani 23144 3000 5744 482 32370 Malayalam 7487 1143 3060 418 12109 Manipuri 10156 2021 3806 332 16351 Marathi 19902 2780 4968 502 28153 Nepali 6748 1477 3227 261 11713 Odiya 27216 2418 5273 377 35284 Punjabi 23255 2836 5830 443 32364 Sanskrit 17413 1246 3990 263 22934 Tamil 11190 2803 5827 477 20308 Telugu 11532 2785 5661 455 20433 Urdu 21979 2800 5786 443 33268