Center for Language Engineering






[ Text Corpora ] [ Image Corpora ] [ Lexical Resources ] [ NLP Applications ]


[ How to Order ]


CLE is making these linguistic resources available without cost for supporting academic, non-commercial research. The processing fees being charged will be used to maintain these resources. You are requested to contact CLE directly for any discounts (applicable only for selective public organizations in Pakistan) or for commercial licensing options.

  CLE Urdu POS Tagger [ Pakistan ] [ International ]
CLE Catalog #: CLE14A002
Release Date: 18 November 2014
Language(s): Urdu
Application Type: API
Platform: JAVA
Distribution: Web Download
Processing Fee (Pakistan): 15000 PKR
Processing Fee (International): 250 USD
License: Yes
  CLE Urdu Part of Speech (POS) Tagger assigns POS tags such as noun, verb, adjective and adverb to each word/token of the input text. After detailed analysis of Urdu text, complete tagset for Urdu has been defined and reported. This tagger is trained using CLE Urdu Digest POS Tagged Corpus 100K and gives a tagging accuracy of 96.8%. For details see:
  1. CLE Urdu POS Tagset
  2. CLE Urdu Parts of Speech (POS) Tagset
  The package of CLE Urdu POS Tagger contains:
  1. CLE Urdu POS Tagger API
  2. CLE Urdu POS Tagger API - Release Notes
  3. CLE Urdu Parts of Speech (POS) Tagset
  The minimum hardware requirements for this application are: Pentium-compatible CPU 2.8 GHz and 1 GB RAM. This application requires Windows XP, Windows Vista or Windows 7 platform with Java Runtime Environment 7.0.
   دنیا کا ہر فرد کامیابی کا آرزومند ہے۔ ناکامی سے سب گھبراتے ہیں۔ عزت، دولت، راحت اور عافیت کی زندگی کے سبھی شیدائی ہیں۔

: ان پٹ

   دنیا/NN کا/PSP ہر/JJ فرد/NN کامیابی/NN کا/PSP آرزومند/NN ہے/VBF ۔/PU ناکامی/NN سے/PSP سب/Q گھبراتے/VBF ہیں/AUXT ۔/PU عزت/NN ،/PU دولت/NN ،/PU راحت/NN اور/CC عافیت/NN کی/PSP زندگی/NN کے/PSP سبھی/Q شیدائی/NN ہیں/VBF ۔/PU

: آؤٹ پٹ

  Online Urdu POS Tagger