Urdu morphological parser Urdu Morphological Analyzer & Generator developed by using Functional Morphology (FM) as a Master thesis. Download: An implementation of Urdu morphology as an open source software API having: A type system that covers language abstraction; An inflection engine that covers word-and-paradigm morphological rules for Urdu; Rules for automatic lexicon extraction using extract tool. A lexicon of 4163 words and 96840 word forms. A manual for users/lexicographers to add new words An implementation of a small part of Urdu syntax in Grammatical Framework A Unicode Infrastructure for the Urdu morphology API By mcswellin Languages > Indic Languages > Indo-Aryan > Urduwith morphologyparserurdu
HtmlCleaner HtmlCleaner is open-source HTML parser written in Java. For the given HTML document, HtmlCleaner reorders individual elements and produces well-formed XML. By default, it follows similar rules that the most of web browsers use in order to create Document Object Model. cleanerhtmlhtmlcleanerparsertidyxml By HarryManbackin Software > toolswith htmljavaparserxml