The 10th Workshop on Asian Language Resources

 
 


Mumbai, India
09 December, 2012

 


 

Program
 

Session 1 - Linguistics Resources
Chair: Ruvan Weerasinghe

9:00

Korean NLP2RDF Resources

YoungGyun Hahm, KyungTae Lim, Jungyeul Park, Yongun Yoon, Key-Sun Choi
KAIST, Les Editions an Amzer Vak

9:30

Building Large Scale Text Corpus for Tibetan Natural Language Processing by Extracting Text from Web Pages

Huidan Liu, Minghua Nuo, Jian Wu, Yeping He
Institute of Software,Chinese Academy of Sciences

10:00

A Structured Approach for Building Assamese Corpus: Insights, Applications and Challenges

Prof. Shikhar Kr. Sarma, Himadri Bharali, Ambeswar Gogoi, Ratul Deka, Anup Kr Barman
Gauhati University, Assam University

10:30

Corpus Building of Literary Lesser Rich Language-Bodo: Insights and Challenges

Biswajit Brahma, Anup Kr. Barman, Prof. Shikhar Kr. Sarma, Bhatima Boro
Gauhati University

11:00

Tea Break

Session 2 - Morphology and Syntax Parsing
Chair: Sarmad Hussain

11:30

Dependency Parsers for Persian

Mojgan Seraji, Beata Megyesi, Joakim Nivre
Department of Linguistics and Philology, Uppsala University

12:00

A New DOP Model for Phrase-structure Parsing of Persian Sentences

Zahra Sarabi and Morteza Analoui
Iran University of Science and Technology, Faculty member of Iran University of Science and Technology

12:30

A Hybrid Dependency Parser for Bangla

Arnab Dhar, Sanjay Chatterji, Sudeshna Sarkar, Anupam Basu
IIT Kharagpur

13:00

Repairing Bengali Verb Chunks for Improved Bengali to Hindi Machine Translation

Sanjay Chatterji, Nabanita Datta, Arnab Dhar, Biswanath Barik, Sudeshna Sarkar, Anupam Basu
IIT kharagpur

13:30

Lunch Break

Session 3 - Knowledge Extraction
Chair: Virach Sornlertlamvanich

14:30

Domain Specific Ontology Extractor For Indian Languages

Brijesh Bhatt and Pushpak Bhattacharyya
IIT, Bombay

15:00 Constrained Hidden Markov Model for Bilingual Keyword Pairs Alignment Denny Cahyadi, Fabien Cromieres, Sadao Kurohashi Kyoto University
15:30 N-gram and Gazetteer List Based Named Entity Recognition for Urdu: A Scarce Resourced Language Faryal Jahangir, Waqas Anwar, Usama Ijaz Bajwa, Xuan Wang
COMSATS Institute of Information Technology, Harbin
Institute of Technology, Shezhen Graduate School, China

16:00

Tea Break

Session 4 - Applications
Chair: Rachel Roxas

16:30

Developing a POS tagger for Magahi: A Comparative Study

Ritesh Kumar, Bornini Lahiri, Deepak Alok
Jawaharlal Nehru University, India

17:00 Enhancing Lemmatization for Mongolian and its Application to Statistical Machine Translation Chimeddorj Odbayar and Atsushi Fujii
Tokyo Institute of Technology
17:30 The Hindi Pronominal Correlatives in Bengali Sanjay Chatterji, Sudeshna Sarkar, Anupam Basu
IIT Kharagpur

webmaster@cle.org.pk