Center for Language Engineering

 
 



 

 

KICS
KICS-UET


 
 

[ Text Corpora ] [ Image Corpora ] [ Speech Corpora ] [ Lexical Resources ] [ NLP Applications ]

 
 

[ How to Order ]

 
   
 

CLE is making these linguistic resources available without cost for supporting academic, non-commercial research. The processing fees being charged will be used to maintain these resources. You are requested to contact CLE directly for any discounts (applicable only for selective public organizations in Pakistan) or for commercial licensing options.

 
     
  CLE Urdu Phrase Chunker [ Pakistan ] [ International ]
   
 
CLE Catalog #: CLE17A005
Release Date: 24 May 2017
Language(s): Urdu
Application Type: API
Platform: Python
Distribution: Web Download
Processing Fee (Pakistan): 30000 PKR
Processing Fee (International): 250 USD
License: Yes
   
  Introduction
  CLE Urdu Chunker (IOB-tagger) assigns IOB tags to obtain syntactic phrases like Noun phrases (NPs), Verb phrases (VPs), Post-positional phrases (PPs) and Prepositional phrases (PrPs). The chunker accepts POS tagged text and outputs IOB tagged text. Each word of the output contains a POS tag and an IOB tag i.e. word/POS/IOB. The chunker is trained on CLE Urdu Digest IOB Tagged Corpus 100K and gives a tagging accuracy of 97.06 % on 10% manually tagged test set. For Urdu POS tagging, please see: Urdu POS Tagger
   
  Package
  The package of CLE Urdu Phrase Chunker contains:
  1. CLE Urdu IOB Tagger API
  2. CLE Urdu IOB Tagger API - Release Notes
   
  System Requirements
  The minimum hardware requirements for this application are: Pentium-compatible CPU 2.8 GHz and 1 GB RAM. This application requires Linux (Chunker is trained and tested on Ubuntu 14.04 LTS) with python 2.7.
   
  Sample
  Input
 
 
  Output
 
   
 
 
 

webmaster@cle.org.pk